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EMISSION RATIOMETRIC INDICATORS OF PHOSPHORYLATION 
BACKGROUND OF THE INVENTION 

Field Of The Invention 

10001] The present invention relates generally to reagents for determining kinase and phosphatase 
activity, and more specifically to chimeric proteins containing two fluorescent proteins and a phosphorylatable 
domain, and methods of using such chimeric proteins to detect kinase or phosphatase activity. 

Description of the Related Art 

[0002] Phosphorylation is the most important way that individual proteins are post-translationally 
modified to modulate their function, while practically all signal transduction involves dynamics of protein-protein 
interaction. Various technologies have been used to enumerate the main phosphorylation/dephosphorylation events 
and interacting protein partners involved in cell function, including, for example, function of cardiomyocytes and B 
lymphocytes. However, the most common currently used technologies such as two dimensional gel electrophoresis, 
mass spectrometry, co-immunoprecipitation assays, and two-hybrid screens require destroying large numbers of the 
cells or transferring genes to heterologous organisms. As such, these methods have poor temporal and spatial 
resolution, and are insufficient to directly probe physiological functions such as contracture or chemotaxis, which 
occur on the time scale of milliseconds to minutes. 

(0003] The most widely used method for detecting phosphorylation of specific proteins in single cells 
utilizes antibodies that discriminate between the phosphorylated and dephosphorylated forms of an antigen. Such 
antibodies can, in principle, reveal the phosphorylation state of the endogenous protein just prior to the time the cells 
were fixed for examination, without any introduction of exogenous substrates. However, the identification of 
antibodies that can discriminate between a phosphorylated and unphosphorylated form of a protein is time consuming 
and expensive. In addition, the necessary immunocytochemistry is tedious, and is difficult to reassemble into a 
quantitative time course. 

(0004] In order to achieve dynamic recording of phosphorylation in single cells, peptides have been 
labeled with acrylodan, a probe whose fluorescence can be sensitive to the phosphorylation of the peptide. For 
example, when acrylodan was attached to a peptide from myosin light chain, an approximately 40% decrease in 
emission peak amplitude upon phosphorylation in vitro was observed. When microinjected into fibroblasts, the peptide 
incorporated into stress fibers, but no dynamic changes were observable. Substrates for CaWIKII and PKA also have 
been labeled with acrylodan and, after exposure to the kinase, fluorescence was about 200% and 97%, respectively, 
of initial values. These peptides were hydrophobic enough to stain live cells, and local intensity changes of up to 10% 
to 20% of initial fluorescence were seen in some regions. The fluorescence of the PKA substrate simultaneously 
decreased in the cytosol and increased in the nucleus by an amount that was greater than could be explained by the in 
vitro sensitivity, indicating that more complex factors such as translocation were dominating. 
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[0005] Although no follow-up studies on the use of these acrylodan-labeled peptides have been 
reported, the approach of developing phosphorylation-sensitive fluorescent substrates may be worth pursuing. 
However, the use of acrylodan-labeled peptides provides no rational mechanism for phosphorylation sensitivity. Thus, 
a need exists for phosphorylation-sensitive indicators that can be used to detect phosphorylation or dephosphorylation 
events in a cell. The present invention satisfies this need and provides additional advantages. 

SUMMARY OF THE INVENTION 

[0006] The present invention relates to a chimeric phosphorylation indicator, which contains, in 
operative linkage, a donor molecule, a phosphorylatable domain, a phosphoaminoacid binding domain, and an acceptor 
molecule, wherein the phosphoaminoacid binding domain specifically binds to a phosphoaminoacid when present in the 
phosphorylatable domain, the donor molecule and the acceptor molecule exhibit a detectable resonance energy 
transfer when the donor is excited, and the phosphorylatable domain and phosphoaminoacid binding domain do not 
substantially emit light to excite the acceptor. The donor molecule or the acceptor or both can be a fluorescent 
protein, or a luminescent molecule, or a combination thereof. In one embodiment, each of the donor molecule and the 
acceptor molecule is a fluorescent protein. In another embodiment one of the donor or acceptor molecule is a 
luminescent molecule and the other is a fluorescent protein. In a third embodiment each of the donor molecule and 
acceptor molecule is a luminescent molecule. 

[0007] Where a chimeric phosphorylation indicator of the invention contains a fluorescent protein 
donor molecule, resonance energy transfer can be detected as fluorescence resonance energy transfer (FRET). Where 
the donor molecule is a luminescent molecule, resonance energy transfer is detected as luminescent resonance energy 
transfer). Depending on the particular structure of the chimeric phosphorylation indicator as disclosed herein, FRET or 
LRET can be increased or decreased due to phosphorylation of the indicator by a kinase, and, likewise, can be 
increased or decreased due to phosphorylation of the indicator by a phosphatase. A change in FRET or LRET can be 
determined by monitoring the emission spectrum of the acceptor. 

10008] A fluorescent protein in a chimeric phosphorylation indictor can be a green fluorescent protein 
(GFP), a red fluorescent protein (RFP), or a fluorescent protein related to a GFP or an RFP, including a non- 
oligomerizing fluorescent protein. An RFP, for example, can be a D/scosoma RFP or a fluorescent protein related to a , 
Discosoma RFP such a D/scosoma DsRed (SEQ ID N0:12) or a mutant thereof (SEQ ID N0:12, including an I125R 
mutation), or a non-oligomerizing tandem DsRed containing, for example, two RFP monomers operatively linked by a 
peptide linker. For example, a non-oligomerizing tandem RFP can contain two DsRed (SEQ ID NO: 12) monomers or 
two mutant DsRed-l125R monomers operatively linked by a peptide having an amino acid sequence as set forth as 
SEQID N0:13. 

[0009] A GFP useful in a chimeric phosphorylation indicator can be an Aequorea GFP, a Renilla GFP, a 
Phialidium GFP, or a fluorescent protein related to an Aequorea GFP, a Renilla GFP, or a Ph/altdium GFP. A fluorescent 
protein related to an Aequorea GFP, for example, can be a cyan fluorescent protein (CFP), or a yellow fluorescent 
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protein (YFP), or a spectral variant of the CFP or YFP, including an enhanced GFP (EGFP; SEQ ID I\I0:4), an enhanced 
CFP (ECFP; SEQ ID N0:6), an ECFPO-227) (amino acids 1 to 227 of SEQ ID N0:6), an EYFP-V68L/Q69K (SEQ ID 
ND:1D), or an enhanced YFP (EYFP; SEQ ID l\!D:8). Furthermore, the fluorescent protein can contain a mutation of an 
amino acid residue corresponding to A206, L221, F223, or a combination thereof of SEQ ID NQ:2, for example, an 
A206K mutation, an L221 K mutation, an F223R mutation, or an L221 K and F223R mutation of SEQ ID l\I0:2; or an 
A206K mutation, an L221 K mutation, an F223R mutation, or an L221 K and F223R mutation of SEQ ID NQ:6 or SEQ 
IDN0-.10. ] 

[001 D] A luminescent molecule useful in a chimeric phosphorylation indicator can be, for example, a 
lanthanide, which can be in the form of a chelate, including a lanthanide complex containing the chelate. Thus, the 
luminescent molecule can be a terbium ion (Tb 3+ ) chelate, for example, a chelate of Tb 3+ and triethylenetetraamine 
hexaacetic acid (TTHA), and can further include carbostyril 124 operatively linked to the Tb 3+ chelate. Where the 
chimeric phosphorylation indicator is to be contacted with a cell, for example, to detect the presence of a kinase or 
phosphatase in the cell, the luminescent molecule can further include a membrane translocating domain such as that 
set forth as SEQ ID N0:18. Such a membrane translocating domain or other molecule to be linked to the luminescent 
molecule can be operatively linked using, for example, a tetracysteine motif such as that set forth in SEQ ID WO: 17. 

1001 1] The phosphorylatable domain in a chimeric phosphorylation indicator of the invention can be 
any molecule that can be phosphorylated by a specific kinase, for example, a serine/threonine kinase or a tyrosine 
kinase, or that can contain a phosphate group and can be dephosphorylated by a specific phosphatase. Thus, the 
phosphorylatable domain can be a synthetic peptide, a peptide portion of a naturally-occurring kinase or phosphatase 
substrate, a peptidomimetic, a polynucleotide, or the like. By way of example, a serine/threonine kinase domain can 
include an amino acid sequence as that set forth in SEQ ID NQ:20 or SEQ ID N0:32, and a tyrosine kinase 
phosphorylatable domain can include an amino acid sequence as that set forth in SEQ ID NQ:23 or SEQ ID N0:25. 

[0012] The phosphoaminoacid binding domain (PAABD) in a chimeric phosphorylation indicator of the 
invention can be an PAABD that specifically binds the particular phosphoaminoacid that is present in the indicator or 
that can be formed due to phosphorylation of the indicator by a kinase. For example, where the phosphorylatable 
domain is a serine/threonine kinase domain, the phosphoaminoacid binding domain is selected such that it can bind a 
phosphoserine or phosphoserine or both, for example, the 14-3-3x (1-232) peptide. Where the phosphorylatable 
domain is a tyrosine kinase domain, the phosphoaminoacid binding domain is selected such that it can bind a 
phosphotyrosine, for example, a Src homology domain-2 (SH2). 

[0013] A chimeric phosphorylation indicator specific for detecting the presence of a serine/threonine 
kinase is exemplified herein by a fusion protein containing, in an orientation from the amino terminus to carboxy 
terminus, an ECFP(1-227) (amino acids 1-227 of SEQ ID N0:6), an MH linker, a 14-3-3x (1-232) phosphoaminoacid 
binding domain, an AGGTGGS (SEQ ID N0:1 9) linker, an LRRASLP (SEQ ID NQ:20) phosphorylatable domain, a 
GGTGGSEL (SEQ ID I\I0:21) linker, and a citrine. Chimeric phosphorylation indicators specific for detecting the 
presence of a tyrosine kinase are exemplified herein by an EGFR indicator containing, in an orientation from the amino 
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terminus to carhoxy terminus, an ECFPU-227) (amino acids 1 to 227 of SEQ ID N0:6) molecule, an SH2 
phosphoaminoacid binding domain from She, a GSHSGSGKP (SEQ ID N0:22) linker, a phosphorylatable domain 
comprising EEEAEYMNMAPQS (SEQ ID N0:23), and citrine; and by a Src indicator containing, in an orientation from 
the amino terminus to carboxy terminus an ECFP( 1-227) (amino acids 1 to 227 of SEQ ID N0:6), an SH2 
phosphoaminoacid binding domain from c-src, GSTSGSGKPGS6EGS (SEQ ID N0:24), a phosphorylatable domain 
comprising EIYGEF (SEQ ID I\I0:25), and citrine. 

[001 4] In one embodiment, the specific amino acid that can be phosphorylated by a kinase in the 
phosphorylatable domain of a chimeric phosphorylation indicator is not phosphorylated, such that the indicator can be 
used to detect the presence of the kinase in a sample. In another embodiment, the specific amino acid that can be 
phosphorylated by a kinase in the phosphorylatable domain of a chimeric phosphorylation indicator is phosphorylated, 
such that the indicator can be used to detect the presence of a phosphatase in a sample. The specific amino acid can 
be any amino acid that can be phosphorylated by a kinase or dephosphorylated by a phosphatase, for example, serine, 
threonine, tyrosine, or a combination thereof. 

[0015] The present invention also relates to a chimeric phosphorylation indicator, which contains a 
phosphorylatable polypeptide and a fluorescent protein. The specific amino acid that can be phosphorylated by a 
kinase in the phosphorylatable polypeptide can be unphosphorylated, such that the indicator can be used to detect a 
kinase activity, or can be phosphorylated, such that the indicator can be used to detect a phosphatase activity. 

[0016] In one embodiment of a chimeric phosphorylation indicator containing a phosphorylatable 
polypeptide and a fluorescent protein, the phosphorylatable polypeptide comprises an N-terminal portion and a C- 
terminal portion, and the fluorescent protein is operatively inserted between the N-terminal portion and C-terminal 
portion of the phosphorylatable polypeptide. The fluorescent protein can be any fluorescent protein, for example, a 
GFP, an RFP, or a fluorescent protein related to a GFP or an RFP, and can be in a circularly permuted form. The 
phosphorylatable polypeptide can be any substrate for a kinase, for example, a tyrosine kinase or a serine/threonine 
kinase, or for a phosphatase. The fluorescent protein can be operatively inserted into any region of the 
phosphorylatable polypeptide, for example, in a hinge region or a turn, provided the ability of the polypeptide to act as 
a substrate is not disrupted. 

[00 1 7] In another embodiment, a chimeric phosphorylation indicator containing a phosphorylatable 
polypeptide and a fluorescent protein further contains a phosphoaminoacid binding domain operatively linked to the 
phosphorylatable polypeptide, wherein the fluorescent protein comprises an N-terminal portion and a C-terminal 
portion, and wherein the phosphorylatable polypeptide and operatively linked phosphoaminoacid binding domain is 
operatively inserted between the N-terminal portion and C-terminal portion of the fluorescent protein. The fluorescent 
protein can be any fluorescent protein, including, a GFP, an RFP, or a fluorescent protein related to a GFP or an RFP. 
For example, the fluorescent protein can be an EYFP, and the phosphorylatable polypeptide and operatively linked 
phosphoaminoacid binding domain can operatively inserted between an amino acid sequence corresponding to amino 
acid positions 145 and 146 of the EYFP or can be substituted for amino acid 145. 
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[001 8] The present invention also relates to polynucleotide encoding chimeric phosphorylation 
indicator, which contains, in operative linkage, a donor molecule, a phosphorylatable domain, a phosphoaminoacid 
binding domain, and an acceptor molecule, wherein the phosphoaminoacid binding domain specifically binds to a 
phosphoaminoacid when present in the phosphorylatable domain, the donor molecule and the acceptor molecule 
exhibit a detectable resonance energy transfer when the donor is excited, and the phosphorylatable domain and 
phosphoaminoacid binding domain do not substantially emit light to excite the acceptor. In addition, the present 
invention relates to a polynucleotide encoding a chimeric phosphorylation indicator containing a phosphorylatable 
polypeptide and a fluorescent protein, wherein the phosphorylatable polypeptide includes an N-terminal portion and a 
C-terminal portion, and the fluorescent protein is operatively inserted between the N-terminal portion and C-terminal 
portion of the phosphorylatable polypeptide. The present invention further relates to a polynucleotide encoding a 
chimeric phosphorylation indicator containing a phosphoaminoacid binding domain operatively linked to a 
phosphorylatable polypeptide and a fluorescent protein, wherein the fluorescent protein includes an N-terminal portion 
and a C-terminal portion, and wherein the phosphorylatable polypeptide and operatively linked phosphoaminoacid 
binding domain is operatively inserted between the N-terminal portion and C-terminal portion of the fluorescent 
protein. 

[0019] Also provided is a vector containing a polynucleotide of the invention, including an expression 
vector, as well as host cells that contain a polynucleotide of the invention or a vector containing such a 
polynucleotide. In one embodiment, a polynucleotide of the invention is operatively linked to an expression control 
sequence, for example, a transcription regulatory element a translation regulatory element or a combination thereof. 
In another embodiment the polynucleotide is operatively linked to a nucleotide sequence encoding a membrane 
translocating domain or a cell compartmentalization domain. 

[0020] The present invention also relates to kits, which contain at least one chimeric phosphorylation 
indicator of the invention, or a polynucleotide encoding such an indicator. A kit of the invention also can contain a 
plurality of different chimeric phosphorylation indicators, or of encoding polynucleotides, as well as a combination 
thereof. Where a kit contains a plurality of different chimeric phosphorylation indicators, the different indicators can 
contain different phosphorylatable domains, or different donor molecules or acceptor molecules or both, or different 
fluorescent proteins, as appropriate to the chimeric phosphorylatable indicator. Where a kit contains a polynucleotide 
encoding a chimeric phosphorylatable indicator, the polynucleotide can be in a vector, or in a host cell, or can be 
operatively linked to one or more expression control sequences. Where a kit contains a plurality of different 
polynucleotides, the polynucleotides can encode a different chimeric phosphorylation indicator, or each can contain 
different expression control sequences, or be contained in different vectors, particularly different expression vectors. 

[0021] The present invention further relates to a method for detecting a kinase or phosphatase in a 
sample. In one embodiment, a method of the invention is performed, for example, contacting the sample with a 
chimeric phosphorylatable indicator, which contains, in operative linkage, a donor molecule, a phosphorylatable 
domain, a phosphoaminoacid binding domain, and an acceptor molecule, wherein the phosphoaminoacid binding domain 
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specifically binds to a phosphoaminoacid when present in the phosphorylatable domain, the donor molecule and the 
acceptor molecule exhibit a detectable resonance energy transfer when the donor is excited, and the phosphorylatable 
domain and phosphoaminoacid binding domain do not substantially emit light to excite the acceptor; exciting the donor 
molecule; and determining a fluorescence or luminescence property in the sample, wherein the presence of a kinase or 
phosphatase in the sample results in a change in the degree of FRET or LRET, thereby detecting the kinase or 
phosphatase in the sample. The change in the degree of FRET or LRET can be an increased amount of FRET or LRET, 
or can be a decreased amount of FRET or LRET, and the change can be indicative of the presence of a kinase in the 
sample, or, where the phosphorylatable domain is phosphorylated prior to contacting the sample with a chimeric 
phosphorylatable indicator, can be indicative of a phosphatase in the sample. 

[0022] In another embodiment, a method for detecting a kinase or phosphatase in a sample is 
performed by contacting the sample with a chimeric phosphorylatable indicator containing a phosphorylatable 
polypeptide and a fluorescent protein, determining a fluorescence property in the sample, wherein the presence of 
kinase or phosphatase activity in the sample results in a change in the fluorescence property as compared to the 
fluorescent property in the absence of a kinase or phosphatase activity, thereby detecting the kinase or phosphatase 
in the sample. The chimeric phosphorylation indicator can containing a phosphorylatable polypeptide that includes an 
N-terminal portion and a C-terminal portion, such that the fluorescent protein is operatively inserted between the 
N-terminal portion and C-terminal portion of the phosphorylatable polypeptide; or the chimeric phosphorylation 
indicator can contain a phosphoaminoacid binding domain operatively linked to a phosphorylatable polypeptide, which 
is operatively inserted between an N-terminal portion and a C-terminal portion of the fluorescent protein. 

10023] The sample to be examined for kinase activity can be any sample, including, for example, a 
sample containing a synthetic product to be examined for kinase or phosphatase activity. In one embodiment, the 
sample is a biological sample, which can be cell, tissue or organ sample, or an extract of such a sample. In another 
embodiment, the method is performed on an intact cell, which can be in cell culture or can be in a tissue sample. For 
such a method, the chimeric phosphorylatable indicator can contain a targeting sequence such as a cell 
compartmentalization domain that can target the chimeric phosphorylatable indicator to cytosol, endoplasmic 
reticulum, mitochondrial matrix, chloroplast lumen, medial trans-Golgi cisternae, a lumen of a lysosome, or a lumen of 
an endosome. A membrane translocating domain can be a particularly useful cell compartmentalization domain is a 
membrane translocating domain, which can facilitate translocation of the chimeric phosphorylation indicator into an 
intact cell. 

[0024] The phosphorylatable polypeptide in a chimeric phosphorylation indicator comprising a 
fluorescent protein and a phosphorylatable polypeptide can be unphosphorylated or phosphorylated at an amino acid 
position specific for a kinase or a phosphatase, depending on whether the method is for detecting a kinase or 
phosphatase. A method of the invention also can be used to detect an absence of kinase or phosphatase activity in 
the sample, for example, due to the presence of a kinase inhibitor or phosphatase inhibitor. 
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[0025] The present invention also relates to a method for detecting a kinase inhibitor or phosphatase 
inhibitor. Such a method can be performed, for example, by determining a first fluorescence property of a chimeric 
phosphorylatable indicator in the presence of a kinase or a phosphatase, contacting the chimeric phosphorylatable 
indicator with a composition suspected of being a kinase inhibitor or a phosphatase inhibitor, determining a second 
fluorescence property of a chimeric phosphorylatable indicator in the presence of the composition, wherein a 
difference in the first fluorescence property and second fluorescence property identifies the composition as a kinase 
inhibitor or phosphatase inhibitor. Such a method is particularly adaptable to high throughput screening methods and, 
therefore, provides a means to screen libraries of compounds to identify a composition that acts as a kinase inhibitor 
or a phosphatase inhibitor. Accordingly, the present invention also provides a kinase inhibitor or a phosphatase 
inhibitor identified by such method. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10026] Figures 1 A to 1 C illustrate three generic designs for phosphorylation-specif ic chimeric reporter 
proteins, which change fluorescence upon phosphorylation. The fluorescent GFPs, "CFP n and "YFP", are indicated, as 
are the phosphoaminoacid binding domain (PAABD). The larger circle in the PAABD indicates the phosphate-binding 
site, which is rich in Arg and Lys residues, and the smaller circle within the larger circle indicates phosphoaminoacids 
(see Figure 6B). The kinase/phosphatase substrate peptide also indicated ("substrate peptide"), and includes any 
spacers present in the construct. In-pointing and out-pointing arrows indicate excitation and emission maxima; 
respectively, for the GFPs, though the actual spectra are broader than the specific numbers shown in the illustration. 

10027] Figure 1 A illustrates a CFP-PAABD-substrate-YFP chimeric reporter protein, in which 
phosphorylation of the substrate can increase FRET. 

[0028] Figure 1 B illustrates a CFP-substrate-YFP-PAABD chimeric reporter protein, in which 
phosphorylation can decrease FRET. 

[0029] Figure 1 C illustrates a YFP(l-l44)-peptide-PAABD-YFP(146-238) chimeric reporter protein, in 
which phosphorylation can modulate the YFP protonation state and emission intensity (shown here as an increase). 

[0030] Figures 2A to 2C show the structures of various phosphoaminoacid-binding domains 
complexed to phosphorylated peptides, fused together and bracketed by CFP and YFP to form chimeric indicators as 
illustrated in Figure 1 A. The dark gray lines represent the protein, with a few key residues involved in binding the 
peptide shown in stick form, and the phosphopeptide shown in ball-and-stick representation. Heavy arrows indicate 
linkers that can connect the protein to the peptide or either of them to CFP or YFP (the arrow direction indicates 
amino to carboxy). The required length in real space of the linker between protein and peptide is indicated. By 
rearrangement of these linkers (not shown), indicators of the generic structures illustrated in Figures 1B and 1C can 
be constructed analogously. 
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[0031] Figure 2A shows the SH2 domain from phospholipase C- complexed to a phosphopeptide (D- 
(pY)-IIPLPD; SEQ ID N0:14) from PDGF receptor (Pascal etaL, £fc// 77:46 1472, 1994, which is incorporated herein 
by reference). This mitten-shaped SH2 domain was the model for the PAABD shown in Figure 1. 

[0032] Figure 2B shows the PTB domain from She complexed with a phosphopeptide HIIENPQ-(pY)-F 
(SEQ ID IM0:1 5) from TrkA (Zhou et al., Nature 378:584-592, 1 995, which is incorporated herein by reference). 

[0033] Figure 2C shows the 14-3-3 domain complexed with the phosphopeptide ARSH-(pS)-YPA 
(SEO ID N0:16; Yaffe £e//91:961-971, 1997, which is incorporated herein by reference). 

[0034] Figure 3 provides a schematic structure of a circularly permuted GFP (cpGFP; cylinder) 
inserted within a protein (clamshell) whose conformation changes upon phosphorylation. The tube (arc) at the top of 
the cpGFP cylinder indicates a spacer linking the original N-terminus and C-terminus of GFP. Linkers connecting the 
new l\I-terminus and C-terminus of the cpGFP to the insertion site within the phosphorylatable protein are indicated by 
tubes. The N-terminus and C-terminus of the chimera are the same as those of the phosphorylatable protein alone, 
and also are indicated by tubes. The circle in the protein indicates the phosphorylated amino acid. In this example, 
phosphorylation favors a closed conformation, which pries open a cleft in the cpGFP, diminishing cpGFP fluorescence. 

10035] Figures 4A and 4B illustrate detection by resonance energy transfer of heterodimer formation 
between proteins X and Y. 

[0036] Figure 4A shows GFP fused to X and the coral RFP fused to Y. Proximity of X and Y 
promotes fluorescence resonance energy transfer (FRET) from GFP to RFP. 

[0037] Figure 4B shows a terbium chelate ("Tb 3+ chelate"), with a carbostyril antenna ("carbostyril), 
attached to X via a biarsenical ligand ("biarsenical"), binding to a "tetracysteine motif" fused to or inserted within X. 
Proximity of X and Y promotes luminescence resonance energy transfer (LRET) from Tb a+ to RFP. Detailed structures 
of the chelate, antenna, and ligand are provided in Figure 5A (note that they are much smaller (1.4 kDa total) than GFP 
(27 kDa)). In-pointing and out-pointing arrows are as in Figure 1. 

[0038] Figures 5A and 5B illustrates labeling of live cells using an engineered construct. 

[0039] Figure 5A provides a detailed structure of the carbostyril 1 24-Tb 3 *- 
triethylenetetraaminehexaacetic acid biarsenical ligand (TTHA)-Antennapedia peptide conjugate for intracellular 
labeling in live cells of proteins engineered to contain a tetracysteine motif ( -Cys-Cys-X-X-Cys-Cys-.; SEQ ID N0:1 7; 
Griffin et ah, Science 281:269-272, 1 998, which is incorporated herein by reference). The constituent units 
corresponding to those indicated in Figure 4B. After passive internalization mediated by the membrane translocating 
peptide (SEQ ID NQ:18), the latter is released by S-S bond cleavage in the reducing environment of the cytosol 
(Derossi etaL, Trends Cell Biol., 8:84-87, 1998, which is incorporated herein by reference). The tetracysteine motif 
displaces two molecules of 1,2-ethanedithiol, which is membrane-permeant. 

[0040] Figure 5B shows the spectral overlap between carbostyril 124 -Tb 3+ -TTHA emission (dotted 
curve, from Li and Selvin, J. Am. Chem. Soc, 1 17:8132-8138, 1995, which is incorporated herein by reference) and 
coral RFP excitation and emission (solid and dash-dotted curves, respectively). The spikes in the solid and dash-dotted 
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curves near 560 nm and 580 nm are scattering artifacts. As desirable for LRET, the dotted trace overlaps well with 
the solid trace, but very little with the dash-dotted trace. 

[00411 Figures 6 A and 6B illustrate the ratiomeric indicator for visualizing serine/threonine 
phosphorylation. 

(0042) Figure 6A illustrates how FRET between GFPs can report phosphorylation and 
dephosphorylation. The ECFP(V227) ("CFP"; amino acids 1 to 227 of SEQ ID N0:6) and citrine ("YFP"; SEQ ID 
NQ:10, except containing a Q69M mutation) are shown as cylinders, reflecting their crystal structures. The 
phosphoaminoacid binding protein is. shown, with a circle inside representing the phosphate-binding site rich in Arg and 
Lys residues (see Figure 6B). The kinase/phosphatase substrate peptide domain, including linkers, also is indicated. 
The smaller circle within the phosphate-binding site (larger circle) represents phosphoaminoacids, as indicated. In- 
pointing and out-pointing arrows with respect to the GFPs indicate excitation (433 nm) and emission (476 nm) 
maxima, respectively. Upon phosphorylation by protein kinase, the adjacent phosphoaminoacid binding protein binds 
to the phosphorylated substrate peptide, which changes the efficiency of FRET between the GFP mutants. 

[0043] Figure 6B illustrates the domain structure of the PKA reporter, including, from N-terminus to 
C-terminus, ECFP(1-227) (amino acids 1 to 227 of SEQ ID l\IO:6), linker 1 (MH), 14-3-3t (1-232), linker 2 (SEQ ID 
NO: 19), the kemptide substrate (SEQ ID N0:20), linker 3 (SEQ ID N0:21), and citrine. 

[00441 Figures 7A and 7B illustrate the structures of the chimeric EGFR (Figure 7A) and Src (Figure 
7B) reporter proteins. 

[0045] Figure 7A shows, from N-terminus to C-terminus, the enhanced cyan fluorescent protein (CFP) 
domain, the SH2 domain from She, a linker peptide (SEQ ID N0:22), the substrate domain, including a peptide portion 
showing the phosphorylation site (SEQ ID f\IO:23), and the enhanced yellow fluorescent protein domain (YFP). 

[0046] Figure 7B shows, from N-terminus to C-terminus, the enhanced cyan fluorescent protein (CFP) 
domain, the SH2 domain from c-src, a linker peptide (SEQ ID l\I0:24), the substrate domain, including a peptide portion 
showing the phosphorylation site (SEQ ID NG:25), and the enhanced yellow fluorescent protein domain (YFP). 

DETAILED DESCRIPTION OF THE INVENTION 
[0047] The present invention provides compositions and methods that are generally useful for non- 
destructive^ detecting and monitoring protein kinase and phosphatase activities and protein-protein interactions in 
individual living eukaryotic cells, including mammalian cells, and provide a means to obtain spatial and temporal 
resolution on the order of a few micrometers and seconds, or better. As disclosed herein, protein kinase and 
phosphatase activities can be imaged using chimeric substrates (chimeric phosphorylation indicators) that incorporate 
reporter molecules such as fluorescent proteins or luminescent complexes, whose properties change significantly as a 
function of the phosphorylation state of the substrate. Protein interactions are detected by resonance energy transfer 
using fluorescent proteins or lanthanide complexes to label the putative partners. The compositions of the invention 
are adaptable to modification using methods such as high throughput combinatorial generation and screening 
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techniques and, therefore, readily can be varied to allow monitoring of any desired kinase, phosphatase, or protein 
interaction. Accordingly, the present invention provides chimeric phosphorylation indicators having various structures 
as disclosed herein. 

10048] In a first embodiment, a chimeric phosphorylation indicator contains, in operative linkage, a 
donor molecule, a phosphorylatable domain, a phosphoaminoacid binding domain, and an acceptor molecule, wherein 
the phosphoaminoacid binding domain specifically binds to a phosphoaminoacid when present in the phosphorylatable 
domain, the donor molecule and the acceptor molecule exhibit a detectable resonance energy transfer when the donor 
is excited, and the phosphorylatable domain and phosphoaminoacid binding domain do not substantially emit light to 
excite the acceptor. In a second embodiment, a chimeric phosphorylation indicator contains a phosphorylatable 
polypeptide and a fluorescent protein. In one aspect of the second embodiment, the phosphorylatable polypeptide 
comprises an N-terminal portion and a C-terminal portion, and the fluorescent protein is operatively inserted between 
the N-terminal portion and C-terminal portion of the phosphorylatable polypeptide. In a second aspect of the second 
embodiment, a phosphoaminoacid binding domain is operatively linked to the phosphorylatable polypeptide, the 
fluorescent protein comprises an N-terminal portion and a C-terminal portion, and the phosphorylatable polypeptide 
and operatively linked phosphoaminoacid binding domain is operatively inserted between the N-terminal portion and C- 
terminal portion of the fluorescent protein. As disclosed herein, the chimeric phosphorylatable indicators of the 
invention are useful for detecting kinase or phosphatase activity, including in a cell. 

10049] Previous techniques for imaging protein heterodimerization in single cells have included 
observing luminescence resonance energy transfer (LRET) from a lanthanide donor attached to an antibody against 
one member of the heterodimer to a red dye attached to an antibody against the other partner (Root, Proc. Natl. Acad. 
ScL, USA 94:5685-5690, 1997). This approach has the same advantages and disadvantages as phosphorylation- 
specific antibodies, including it is applicable to examining endogenous proteins in intact nontransfected tissues, but 
has poor time resolution and difficulty in generating a continuous time course. Another mode of energy transfer is 
bioluminescence resonance energy transfer, in which the donor is a lucif erase and the acceptor is a GFP (Xu etai, 
Proc. Natl. Acad. ScL, USA 96: 1 5 1 • 1 56, 1 999). However, this method has thus far only been demonstrated in 
bacteria, and the feebleness of bioluminescence will make the technique difficult or impossible to use with single 
mammalian cells, especially if high spatial or temporal resolution is desired. 

{0050] Protein complementation assays, in which each potential partner is fused to a fragment of a 
reporter protein, have a much greater dynamic range between fully interacting and noninteracting states because the 
reporter enzyme fragments are completely dead before reassembly. In addition, in energy transfer methods, the donor 
and acceptor do not have to touch each other, and the response is instantaneous, fully reversible, and has minimal 
effect on the kinetics or affinity of the partners' binding. Such assays also can be complementary to spectroscopic 
techniques as described below. 

[0051] FRET-based (cameleon-type) strategies is an established strategy for monitoring certain 
conformational changes in a natural or chimeric proteins. In a FRET-based strategy, the protein of interest is 

-10- 



WO 02/095058 



PCT/US02/16955 



sandwiched between GFP mutants that are capable of FRET (T sien r Ann. Rev. Biochem., 67:509*544, 1998; Heim, 
Meth. Enzymol, 302:408-423, 1999, each of which is incorporated herein by reference}. Changes in conformation of 
the central protein affect the distance or relative orientation between the flanking GFPs, altering the efficiency of 
FRET, which can readily be imaged in single cells. This strategy has been successfully employed to monitor 1) 
protease-mediated cleavage of peptide linkers; 2) Ca 2+ -induced binding of calmodulin to peptides from myosin light 
chain kinase ; 3) Zn 2+ binding to the Zn 2+ finger from the transcription factor zif 268; 4) cAMP-induced dissociation of 
the regulatory and catalytic subunits of PKA (Zaccolo et at., Nat. Cell Biol., 2:25-29, 2000); 5) cGMP-mediated 
changes in the conformation of cGMP-dependent protein kinase; and 6) agonist and antagonist binding to nuclear 
hormone receptors. 

10052] As disclosed herein, the FRET strategy has been extended to monitor phosphorylation of a 
consensus substrate for a kinase (Pearson and Kemp, Meth. Enzymol, 200:62-81, 1991; Kemp and Pearson, Trends 
Biochem. ScL, 15:342-346, 1990, each of which is incorporated herein by reference) or phosphatase (Blumenthal, in 
"Peptides and Protein Phosphorylation" (Kemp, ed; CRC Press 1990), pages 135-143, which is incorporated herein by 
reference). Such a chimera (Figure 1 A) can include (in order from N-terminus to C -terminus) a GFP mutant such as 
ECFP, a phosphorylatable domain, a domain capable of binding the phosphoaminoacid in the peptide, and a second GFP 
mutant such as a pH-insensitive EYFP (see Figure 1 A; see, also, Miyawaki etaL, Proc. Natl. Acad. ScL, USA 96:2135- 
2140, 1999, which is incorporated herein by reference). Phosphorylation of the substrate domain permits the 
formation of an intramolecular complex with the operatively linked phosphoaminoacid binding domain (PAABD), 
thereby allowing for an increased FRET between the two fluorescent proteins. 

[0053] The modular nature of the chimeric phosphorylation indicators facilitates the insertion or 
substitution of various components, as desired. For example, a Discosoma RFP, DsRed, can be substituted for EYFP 
and ECFP can be substituted with EGFP in the structure exemplified in Figure 1 A. In addition, the ordering of the 
donor and acceptor molecules can be interchanged, as can the order of the phosphorylatable domain and the PAABD. 
In addition, the junctions between the components can be varied using peptide linkers such as those exemplified herein 
to optimize the responsiveness of the indicators. 

10054] A fluorescent protein useful in a constructing a chimeric phosphorylation indicator of the 
invention can be any fluorescent protein, including, for example, a green fluorescent protein (GFP) such as an 
Aequorea victoria GFP, a Renilla renif ormis GFP, a Phialidium gregarium GFP; a red fluorescent protein (RFP) such as a 
Discosoma RFP; or a fluorescent protein related to a GFP or an RFP, such as a cyan fluorescent protein (CFP), a 
yellow fluorescent protein (YFP), an enhanced GFP (EGFP; SEQ ID N0:4), an enhanced CFP (ECFP; SEQ ID I\I0:6), an 
enhanced YFP (EYFP; SEQ ID NQ:8), a DsRed fluorescent protein (SEQ ID NO: 12), citrine, which has an amino acid 
sequence as set forth in SEQ ID NO: 10, except containing a Q69M mutation; or a mutant or variant of such 
fluorescent proteins. 

[0055] A fluorescent protein useful in a constructing a chimeric phosphorylation indicator of the 
invention also can be a non-oligomerizing fluorescent protein, in which the propensity of the fluorescent protein to 
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oligomerize is reduced or eliminated. The propensity of a fluorescent protein to oligomerize can be reduced or 
eliminated by operatively linking a first monomer of a fluorescent protein to at least a second monomer of the 
fluorescent protein, thereby forming an intramolecular 'dimer', Primer' or the like. Such operatively linked 
homopolymers, which have a substantially reduced ability to form intermolecular oligomers, are exemplified by two 
monomers of DsRed (SEQ ID NO: 12) operatively linked by a peptide linker [SEQ ID NO: 13), and by two monomers of a 
mutant DsRed, which has an amino acid sequence of SEQ ID NQ:12, and including an I125R mutation, operatively 
linked by the peptide linker of SEQ ID NO: 13. The propensity of a fluorescent protein to oligomerize also can be 
reduced or eliminated by introducing one or more mutations into the fluorescent protein. Such mutations are 
exemplified by a mutation of one or a combination of amino acid residues A206, L221 or F223 tiAequorea GFP (SEQ 
ID N0:2), or a mutation of another fluorescent protein that corresponds to a mutation of A206, L221 or F223 of SEQ 
ID N0:2 7 for example, by the mutations A206K, L221K, F223R of GFP (SEQ ID N0:2), or by the mutations L221K and 
F223R of ECFP (SEQ ID N0:6) and EYFP-V68L/Q69K (SEQ ID N0:1Q), which are spectral variants tiAequorea GFP. 

[0056] Aequorea GFP is widely used in cell biology as a protein module that can be fused to host 
proteins to make the latter fluorescent (Tsien, Ann. Rev. Biochem., 67:509-544, 1998, which is incorporated herein 
by reference). For example, GFP is commonly used to characterize subcellular localization and trafficking properties of 
proteins, to which the GFP is fused. In addition, spectral variants of GFP, including CFP and YFP and variants thereof 
have been used to measure the associative properties of host proteins by FRET. FRET between CFP and YFP also has 
been exploited to create biosensors for calcium ion, and to determine the associative properties of growth factor 
receptors and G protein-coupled receptors. 

[0057) The availability of a wide range of variously-colored "spectral mutants" of GFP has provided a 
potential means for monitoring the associative properties of proteins via FRET. FRET is a quantum mechanical 
phenomenon of radiation-less energy transfer between two f luorophores, that is dependent on the proper spectral 
overlap of a donor and an acceptor, their distance from each other, and the relative orientation of the chromophores' 
transition dipoles. Using standard molecular biology technology, fusions can be generated between proteins of 
interest and spectral mutants of fluorescent proteins, which can then serve effectively as donor and acceptor FRET 
partners. As indicated above, the GFP spectral mutants have most of the requisite properties to serve as useful FRET 
partners, except for their homoaffinity and propensity for dimerization. Thus, while the number of FRET-based assays 
using GFP and its variants is increasing (see, for example, Mitra eta/., Gene 173:13-17, 1996; Hartman and Vale, 
Science 286:782-785, 1999; Zacharias et al., Curr. Opm. NeurobioL, 10:416421, 2000), the propensity of the GFP- 
related fluorescent proteins to associate with each other can complicate characterization of protein associations 
reported by FRET, which should be due solely to interactions of the proteins with no participation from the 
fluorophore to which they are linked. 

(0058) A luminescent molecule also can be useful in a chimeric phosphorylation indicator. For 
example, a lanthanide, which can be in the form of a chelate, including a lanthanide complex containing the chelate. 
Thus, the luminescent molecule can be a terbium ion (Tb 3+ ) chelate, for example, a chelate of Tb 3+ and 
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triethylenetetraamine hexaacetic acid (TTHA), and can further include carbostyril 124 operatively linked to the 
Tb 3+ chelate. Where the chimeric phosphorylation indicator is to be contacted with a cell, for example, to detect the 
presence of a kinase or phosphatase in the cell the luminescent molecule can further include a membrane 
translocating domain such as that set forth as SEQ ID N0:18. Such a membrane translocating domain or other 
molecule to be linked to the luminescent molecule can be operatively linked using, for example, a tetracysteine motif 
such as that set forth in SEQ ID NO: 17. 

[0059] Lanthanide chelates are ideal donors for luminescence resonance energy transfer (LRET) to 
RFPs such as DsRed, and provides four major advantages over conventional FRET {Selvin, IEEE J. Sel. Top. Quant 
Electron. 2:1077-1087, 1996; Li and Selvin,./. Am. Chem. Soc. 1 17:8132-8138, 1995, each of which is incorporated 
herein by reference): 1) Because the emission spectrum of lanthanide chelates consists of sharp lines rather than 
broad peaks, it is easy to separate the donor lanthanide emission from that of any conventional acceptor. 2) Because 
the LRET-sensitized emission from the acceptor has millisecond kinetics, it can be easily discriminated from directly- 
excited emission, which has nanosecond kinetics. These two advantages mean that even very low degrees of LRET 
(i.e., interactions at distances much greater than the Forster distance, Ro) can be detected. 3) The background 
autofluorescence also has lifetimes in nanoseconds and can likewise be easily eliminated by pulsed excitation and 
gated detection. Thus, much lower concentrations of probes and labeled proteins should be sufficient. 4) The 
lanthanide emission is essentially unpolarized, so that the dependences of LRET on the orientations of the donor and 
acceptor are respectively eliminated and greatly reduced. 

[0060] A problem with intracellular LRET is that few methods have been established for introducing 
extremely polar lanthanide chelates into intact cells and site-specifically labeling proteins there. Nonpolar lanthanide 
chelates such as Tris (3-p-diketonates) are available, but they lack sufficient kinetic stability for use in the presence 
of intact proteins and cells. Known chelates that are stable enough are extremely hydrophilic polyaminocarboxylates, 
which have never been demonstrated to cross cell membranes unaided. As such, the lanthanide complexes used in the 
chimeric phosphorylation indicators of the invention can be conjugated to a membrane translocating peptide such as 
the Antennapedia hydrophobic signal peptides (Rojas etal, Nat Biotechnoi, 16:370-375, 1998, which is 
incorporated herein by reference), VP22 of herpes virus (Elliot and O'Hare, Cell 88:223-233, 1997, which is 
incorporated herein by reference), or the TAT protein, which can mediate import of protein multimers and large, 
polyanions such as antisense nucleic acids of at least 55 bases (Derossi et at, Trends Cell Biol, 8:84-87, 1998, 
which is incorporated herein by reference). 

[0061 1 The site-specific attachment of the lanthanide chelates to recombinant proteins in intact cells 
ca be accomplished by linking the lanthanide complex to a biarsenical reagent, which can selectively find and bind a 
particular tetracysteine motif engineered into a host protein (Griffin et at, Science 281 -.269-272, 1 998, which is 
incorporated herein by reference). Such a structure is exemplified in Figure 5A, wherein one end of a Tb 3+ chelate is 
attached to the sensitizer or antenna chromophore, e.g., a carbostyril dye (Li and Selvin, supra, 1995), and the other 
end is attached to the biarsenical reagent and the membrane-translocating peptide via the simplest heterotrifunctional 
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linker, L-cysteine. The biarsenical reagent is not a fluorescent dye, as previously described, so that it does not quench 
or interfere with the sensitizer or Tb 3+ , and such that the fluorescein moiety is methylated twice to force it into the 
colorless spiro form. The linkage to the membrane-translocating peptide (in this case penetratin-1 from Antennapedia; 
Derossi etai, supra, 1998) is via a disulfide bond, which will be cleaved in the reducing environment of the cytosol. 
Thus the translocating peptide will be discarded before the biarsenical unit attaches the Tb 3+ compIex to the 
tetracysteine-containing host protein. 

[0062] DsRed or a mutant thereof can be an acceptor molecule for LRET from the Tb 3+ chelate, as 
the spectral overlap between Tb 3 * emission and the RFP excitation spectrum is very good (see Figure 5B). The 
Tb 3 *chelate is excited via the carbostyril antenna either with a pulsed nitrogen laser at 337 nm or a mercury-xenon 
flashlamp at 365 nm. Emissions from both the Tb 3+ donor and the RFP acceptor are collected in time-resolved 
manner, which is easy because the relevant time scale is milliseconds. If there is a significant amount of Tb 3+ donor 
very far from the RFP acceptor, because either the donor has not found its target protein or the latter is not bound to 
its partner bearing the RFP, the corresponding emission will have a very long lifetime characteristic of free Tb 3+ 
complex. When the Tb 3+ complex and the RFP are associated, the lifetime of the Tb 3+ complex is somewhat 
shortened, whereas the RFP develops a component of similar lifetime. Both components can be quantified, as they 
are easily separated due to the narrow emission lineshape of the Tb 3 \ 

[0063] The phosphorylatable domain in a chimeric phosphorylation indicator of the invention can be 
any molecule that can be phosphorylated by a specific kinase, for example, a serine/threonine kinase or a tyrosine 
kinase, or that can contain a phosphate group and can be dephosphorylated by a specific phosphatase. Thus, the 
phosphorylatable domain can be a synthetic peptide, a peptide portion of a naturally-occurring kinase or phosphatase 
substrate, a peptidomimetic, a polynucleotide, or the like. By way of example, a serine/threonine kinase domain can 
include an amino acid sequence as that set forth in SEQ ID l\IO:20 or SEQ ID N0:32, and a tyrosine kinase 
phosphorylatable domain can include an amino acid sequence as that set forth in SEQ ID N0:23 or SEQ ID N0:25. 

[0064] The phosphoaminoacid binding domain (PAABD) in a chimeric phosphorylation indicator of the 
invention can be an PAABD that specifically binds the particular phosphoaminoacid that is present in the indicator or 
that can be formed due to phosphorylation of the indicator by a kinase. For example, where the phosphorylatable 
domain is a serine/threonine kinase domain, the phosphoaminoacid binding domain is selected such that it can bind a 
phosphoserine or phosphoserine or both, for example, the 14-3-3x (1-232) peptide. Where the phosphorylatable 
domain is a tyrosine kinase domain, the phosphoaminoacid binding domain is selected such that it can bind a 
phosphotyrosine, for example, a Src homology domain-2 (SH2). 

10065] The SH2 or PTB domains, which bind phosphotyrosine in certain contexts, are the best known 
and most well understood PAABDs. For example, the MWIR structure of an SH2 domain from phosphoIipase-C- 
complexed with a phosphopeptide from the PDGF receptor (Pascal and Singer, supra, 1994) suggests these two 
modules can be fused and sandwiched between a donor and acceptor molecule such as CFP and YFP (see Figure 2A). 
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Similarly, the structure of a PTB domain from She binding a partner peptide from TrkA (Zhou eta/., supra, 1 995) 
indicates that these elements also can be used in a chimeric phosphorylation indicator (see Figure 2B). 

[0066] Phosphoserine/phosphothreonine binding domains also have been described, including those of 
the 14-3-3 proteins. For example, 14-3-3 recognizes a consensus sequence R(R/K)(F/R/S/N)(R/H/K) pSer (W/Y/F/UP 
(SEQ ID N0:42; Yaffe etai, supra, 1997) that can encompass consensus sites for several important protein kinases 
such as PKA, PKG, and PKC (Pearson and Kemp, supra, 1991; Kemp and Pearson, supra, 1990). Figure 2C 
exemplifies of how kinase activity can be reported using a chimeric phosphorylatable indicator comprising a 14-3-3 
protein fused to a domain that is phosphorylatable on a Ser residue. 

[0067] As disclosed herein, PAABDs also can be identified, for example, by screening phage display 
libraries using binding to immobilized phosphoaminoacids for affinity selection, or can be evolved from known 
orthophosphate-binding or sulf ate-binding proteins, or from phosphatases by reduction of their catalytic activity. A 
simple example of the latter approach was the replacement of a catalytic aspartate residue in the tyrosine 
phosphatase, PTP1B, thus converting it into a "substrate-trap", i.e., a high-affinity binder of phosphotyrosine- 
containing peptides with a very slow hydrolytic rate (Flint etai, Proc. Natl Acad. Scl, USA 94:1680-1 685, 1997, 
which is incorporated herein by reference). It should be recognized that a small residual phosphatase activity can be 
quite beneficial because it can provide a backup mechanism to gradually reset the chimeric phosphorylation indicator 
back to the nonphosphorylated state, if it is not a substrate for an endogenous phosphatase. 

[0068] As disclosed herein, FRET can increase or decrease upon phosphorylation or 
dephosphorylation of a chimeric phosphorylation indicator. For example, using chimeras of GFP-CaM binding peptide- 
BFP-permuted CaM, FRET from BFP to GFP was strong in the absence of Ca2+, presumably because the CaM-binding 
peptide was flexible enough to allow the two fluorescent proteins to dimerize, whereas, upon elevation of Ca2+, 
dimerization and FRET were disrupted, perhaps because (Ca2+)4-CaM binding forced the peptide into a more 
extended conformation (Perschini etai, Cell Calcium 22:209-216, 1997, which is incorporated herein by reference). 
As such, substitution of the CaM-binding peptide and CaM by a phosphorylatable domain and a PAABD can be used to 
generate sensors in which kinase (or phosphatase) activity decreases FRET (see Figure IB), the having an opposite 
response from the cameleon-type indicators described above (see, also, Figure 1 A). Similarly, the connectivity of the 
spacers in the structures shown in Figure 2 can be modified to generate such Perschini-type chimeric phosphorylatable 
indicators. 

[0069] The present invention also provides a chimeric phosphorylation indicator containing a 
phosphorylatable polypeptide and a fluorescent protein. These indicators were modeled on chimeras in which a CaM 
or a Zn-finger motif was inserted in place of Y 145 of EYFP, whereupon binding of Ca2+or Zn2+ enhanced the 
fluorescence by 7 or 1.7-fold respectively (Baird eta/., Proc. Natl. Acad. ScL, USA 96:1 1241-1 1246, 1999, which is 
incorporated herein by reference). A GFP with p-lactamase inserted between residues 172 and 173 increased 
fluorescence -1.5 fold upon addition of (3-lactamase-inhibitory protein (Selvin, supra, 1996). The application of this 
strategy to phosphorylation detection is exemplified in Figure 1C, wherein a phosphorylatable peptide and PAABD are 
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inserted within position 145 of EYFP. a conformational change in the insert can modulate the fluorescence efficiency 
of the EYFP reporter. This strategy utilizes only one fluorescent protein, and can give very large responses, including 
changes in pH-sensitive fluorescence at one wavelength rather than a change in ratio of emissions at two 
wavelengths. 

[0070] Accordingly, in one embodiment of a chimeric phosphorylation indicator containing a 
phosphorylatable polypeptide and a fluorescent protein, a phosphoaminoacid binding domain is operatively linked to 
the phosphorylatable polypeptide, the fluorescent protein comprises an N-terminal portion and a C-terminal portion, 
and the phosphorylatable polypeptide and operatively linked phosphoaminoacid binding domain is operatively inserted 
between the N-terminal portion and C-terminal portion of the fluorescent protein. The fluorescent protein can be any 
fluorescent protein, including, a GFP, an RFP, or a fluorescent protein related to a GFP or an RFP, and is exemplified 
by an EYFP in which the phosphorylatable polypeptide and operatively (inked phosphoaminoacid binding domain is 
operatively inserted between amino acid positions 145 and 146 of the EYFP or is substituted for amino acid 145. 

[0071] For many kinases and phosphatases, an artificial substrate, which can be operatively linked to 
PAABDs and a fluorescent protein, may not be a good surrogate for the normal endogenous substrates. As such, a 
fluorescent protein such as GFP, particularly a circularly permuted GFP fcpGFP), can be operatively inserted into the 
endogenous substrate, thus generating a chimeric phosphorylation indicator (see Figure 3). Preferably, the fluorescent 
protein is inserted at a hinge region or turn, such that phosphorylation can significantly change the local conformation, 
which is transmitted to the fluorescent protein, thereby modifying its fluorescence as discussed above. Accordingly, 
in another embodiment of a chimeric phosphorylation indicator containing a phosphorylatable polypeptide and a 
fluorescent protein, the phosphorylatable polypeptide comprises an N-terminal portion and a C-terminal portion, and 
the fluorescent protein is operatively inserted between the N-terminal portion and C-terminal portion of the 
phosphorylatable polypeptide. 

[0072] Unless specifically indicated otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by those of ordinary skill in the art to which this invention belongs. In 
addition, any method or material similar or equivalent to a method or material described herein can be used in the 
practice the present invention. For purposes of the present invention, the following terms are defined. 

[0073] The term "nucleic acid molecule" or "polynucleotide" refers to a deoxyribonucleotide or 
ribonucleotide polymer in either single-stranded or double-stranded form, and, unless specifically indicated otherwise, 
encompasses polynucleotides containing known analogs of naturally occurring nucleotides that can function in a 
similar manner as naturally occurring nucleotides. It will be understood that when a nucleic acid molecule is 
represented by a DNA sequence, this also includes RNA molecules having the corresponding RNA sequence in which 
"U" replaces "T." 

[0074] The term "recombinant nucleic acid molecule" refers to a non-naturally occurring nucleic acid 
molecule containing two or more linked polynucleotide sequences. A recombinant nucleic acid molecule can be 
produced by recombination methods, particularly genetic engineering techniques, or can be produced by a chemical 
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synthesis method. A recombinant nucleic acid molecule can encode a fusion protein, for example, a chimeric 
phosphorylation indicator of the invention linked to a polypeptide of interest such as a cell compartmentalization 
domain. The term "recombinant host cell" refers to a cell that contains a recombinant nucleic acid molecule. As such, 
a recombinant host cell can express a polypeptide from a "gene" that is not found within the native (non-recombinant) 
form of the cell. 

[0075] Reference to a polynucleotide "encoding" a polypeptide means that, upon transcription of the 
polynucleotide and translation of the mRNA produced therefrom, a polypeptide is produced. The encoding 
polynucleotide is considered to include both the coding strand, whose nucleotide sequence is identical to an mRNA, as 
well as its complementary strand. It will be recognized that such an encoding polynucleotide is considered to include 
degenerate nucleotide sequences, which encode the same amino acid residues. Nucleotide sequences encoding a 
polypeptide can include polynucleotides containing introns as well as the encoding exons. 

[0076] The term "expression control sequence" refers to a nucleotide sequence that regulates the 
transcription or translation of a polynucleotide or the localization of a polypeptide to which to which it is operatively 
linked. Expression control sequences are "operatively linked" when the expression control sequence controls or 
regulates the transcription and, as appropriate, translation of the nucleotide sequence (i.e., a transcription or 
translation regulatory element, respectively), or localization of an encoded polypeptide to a specific compartment of a 
cell. Thus, an expression control sequence can be a promoter, enhancer, transcription terminator, a start codon 
(AT6), a splicing signal for intron excision and maintenance of the correct reading frame, a STOP codon, a ribosome 
binding site, or a sequence that targets a polypeptide to a particular location, for example, a cell compartmentalization 
signal, which can target a polypeptide to the cytosol, nucleus, plasma membrane, endoplasmic reticulum, 
mitochondrial membrane or matrix, chloroplast membrane or lumen, medial trans-Golgi cisternae, or a lysosome or 
endosome, or is a membrane translocating peptide, which allows a molecule operatively linked thereto to cross a cell 
membrane and enter an intact cell. Cell compartmentalization domains are well known in the art and include, for 
example, a peptide containing amino acid residues 1 to 81 of human type II membrane-anchored protein 
galactosyltransf erase, or amino acid residues 1 to 12 of the presequence of subunit IV of cytochrome c oxidase (see, 
also, Hancock etal., EMBOJ., 10:4033-4039, 1991; Buss etal., Mot. Cell Bio/., 8:3960-3963, 1988; U.S. Patent 
No. 5,776,689, each of which is incorporated herein by reference). 

[0077] The term "operatively linked" or "in operative linkage" also is used herein to indicate that the 
components of a chimeric phosphorylation indicator are joined together such that each component maintains its 
function relevant to phosphorylation detection, or can be induced to express its function. For example, the 
phosphorylatable domain is operatively linked to the PAABD such that, when an amino acid residue of the 
phosphorylatable domain is phosphorylated, the PAA60 can bind to the phosphoaminoacid. Similarly, the donor 
molecule and acceptor molecule are in operative linkage, through the phosphorylatable domain and PAABD, as well as 
any linker molecules, such that, upon excitation of the donor molecule, FRET can occur so as to excite the acceptor, 
which, in response, fluoresces. Methods for operatively linking the components of a chimeric phosphorylation 
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indicator, including the use of linker and spacer peptides and the like, can be determined rationally based, for example, 
on crystaliographic information, can be extrapolated from the methods and compositions disclosed herein, or can be 
determined empirically. 

10078] The term "operatively inserted" is used similarly herein to refer to the introduction of a first 
polypeptide into a second polypeptide, at a position between the N-termtnus and C-terminus of the second 
polypeptide, such that each of the polypeptides maintains its original function or can be induced to express its original 
function. For example, where a phosphoryiatable polypeptide is operatively inserted into a fluorescent protein, the 
phosphorylatable polypeptide maintains its ability to act as a substrate for a phosphatase or kinase, and the 
fluorescent protein maintains its characteristic fluorescence property, although the fluorescence property may not be 
exhibited due, for example, to the phosphorylation state of the phosphorylatable polypeptide. 

[00791 The term "oligomer" refers to a complex formed by the specific interaction of two or more 
polypeptides. A "specific interaction" or "specific association" is one that is relatively stable under specified 
conditions, for example, physiologic conditions. Reference to a "propensity" of proteins to oligomerize indicates that 
the proteins can form dimers, trimers, tetramers, or the like under specified conditions. Generally, fluorescent proteins 
such as GFPs and DsRed have a propensity to oligomerize under physiologic conditions although, as disclosed herein, 
fluorescent proteins also can oligomerize, for example, under pH conditions other than physiologic conditions. 

[0080] The term "probe" refers to a substance that specifically binds to another substance (a 
"target"). Probes include, for example, antibodies, polynucleotides, receptors and their ligands, and generally can be 
labeled so as to provide a means to identify or isolate a molecule to which the probe has specifically bound. The term 
"label" refers to a composition that is detectable with or without the instrumentation, for example, by visual 
inspection, spectroscopy, or a photochemical, biochemical immunochemical or chemical reaction. Useful labels 
include, for example, phosphorus-32, a fluorescent dye, a fluorescent protein, an electron-dense reagent, an enzymes 
(such as is commonly used in an ELISA), a small molecule such as biotin, digoxigenin, or other haptens or peptide for 
which an antiserum or antibody, which can be a monoclonal antibody, is available. It will be recognized that a non- 
oligomerizing fluorescent protein of the invention, which is itself a detectable protein, can nevertheless be labeled so 
as to be detectable by a means other than its own fluorescence, for example, by incorporating a radionuclide label or a 
peptide tag into the protein so as to facilitate, for example, identification of the protein during its expression and 
isolation of the expressed protein, respectively. A label useful for purposes of the present invention generally 
generates a measurable signal such as a radioactive signal, fluorescent light, enzyme activity, and the like, either of 
which can be used, for example, to quantitate the amount of the non-oligomerizing fluorescent protein in a sample. 

[0081] The term "nucleic acid probe" refers to a polynucleotide that binds to a specific nucleotide 
sequence or sub-sequence of a second (target) nucleic acid molecule. A nucleic acid probe generally is a 
polynucleotide that binds to the target nucleic acid molecule through complementary base pairing. It will be 
understood that a nucleic acid probe can specifically bind a target sequence that has less than complete 
complementarity with the probe sequence, and that the specificity of binding will depend, in part, upon the stringency 
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of the hybridization conditions. A nucleic acid probes can be labeled as with a radionuclide, a chromophore, a 
lumiphore, a chromogen, a fluorescent protein, or a small molecule such as biotin, which itself can be bound, for 
example, by a streptavidin complex, thus providing a means to isolate the probe, including a target nucleic acid 
molecule specifically bound by the probe. By assaying for the presence or absence of the probe, one can detect the 
presence or absence of the target sequence or sub-sequence. The term "labeled nucleic acid probe" refers to a nucleic 
acid probe that is bound, either directly or through a linker molecule, and covalently or through a stable non-covalent 
bond such as an ionic, van der Waals or hydrogen bond, to a label such that the presence of the probe can be 
identified by detecting the presence of the label bound to the probe. 

[0082] The term "polypeptide" or "protein" refers to a polymer of two or more amino acid residues. 
The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The term 
"recombinant protein" refers to a protein that is produced by expression of a nucleotide sequence encoding the amino 
acid sequence of the protein from a recombinant DNA molecule. 

[0083] The term "isolated" or "purified" refers to a material that is substantially or essentially free 
from components that normally accompany the material in its native state in nature. Purity or homogeneity generally 
are determined using analytical chemistry techniques such as polyacryiamide gel electrophoresis, high performance 
liquid chromatography, and the like. A polynucleotide or a polypeptide is considered to be isolated when it is the 
predominant species present in a preparation. Generally, an isolated protein or nucleic acid molecule represents 
greater than 80% of the macromolecular species present in a preparation, often represents greater than 90% of all 
macromolecular species present, usually represents greater than 95%, of the macromolecular species, and, in 
particular, is a polypeptide or polynucleotide that purified to essential homogeneity such that it is the only species 
detected when examined using conventional methods for determining purity of such a molecule. 

[0084] The term "naturally-occurring" is used to refer to a protein, nucleic acid molecule, cell, or 
other material that occurs in nature. For example, a polypeptide or polynucleotide sequence that is present in an 
organism, including in a virus. A naturally occurring material can be in its form as it exists in nature, and can be 
modified by the hand of man such that, for example, is in an isolated form. 

[0085] The term "antibody" refers to a polypeptide substantially encoded by an immunoglobulin gene 
or immunoglobulin genes, or antigen-binding fragments thereof, which specifically bind and recognize an analyte 
(antigen). The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu 
constant region genes, as well as the myriad immunoglobulin variable region genes. Antibodies exist as intact 
immunoglobulins and as well characterized antigen-binding fragments of an antibody, which can be produced by 
digestion with a peptidase or can using recombinant DNA methods. Such antigen-binding fragments of an antibody 
include, for example, Fv, Fab' and F(ab)'2 fragments. The term "antibody," as used herein, includes antibody 
fragments either produced by the modification of whole antibodies or those synthesized de novo using recombinant 
DNA methodologies. As such, the present invention also provides an antibody or antigen-binding fragment thereof 
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that specifically binds a chimeric phosphorylation indicator of the invention. Preferably, an antibody of the invention 
does not specifically bind the individual components that comprise the chimeric indicator, except when the 
components are part of the chimeric phosphorylation indicator. The term "immunoassay" refers to an assay that 
utilizes an antibody to specifically bind an analyte. An immunoassay is characterized by the use of specific binding 
properties of a particular antibody to isolate, target, and/or quantify the analyte. 

[0086] The term "identical," when used in reference to two or more polynucleotide sequences or two 
or more polypeptide sequences, refers to the residues in the sequences that are the same when aligned for maximum 
correspondence. When percentage of sequence identity is used in reference to a polypeptide, it is recognized that one 
or more residue positions that are not otherwise identical can differ by a conservative amino acid substitution, in 
which a first amino acid residue is substituted for another amino acid residue having similar chemical properties such 
as a similar charge or hydrophobic or hydrophilic character and, therefore, does not change the functional properties 
of the polypeptide. Where polypeptide sequences differ in conservative substitutions, the percent sequence identity 
can be adjusted upwards to correct for the conservative nature of the substitution. Such an adjustment can be made 
using well known methods, for example, scoring a conservative substitution as a partial rather than a full mismatch, 
thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a 
score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score 
between zero and 1 . The scoring of conservative substitutions can be calculated using any well known algorithm 
(see, for example, Meyers and Miller, Comp. Appl. Biol Sci. 4:1 M 7, 1988; Smith and Waterman, Adv. AppL Math. 
2:482, 1981; Needleman and Wunsch, J. Mol. Biol 48:443, 1970; Pearson and Lipman, Proc. Natl. Acad. Sci., USA 
85:2444 (1988); Higgins and Sharp, Gene 73:237-244, 1988; Higgins and Sharp, CAB/OS 5:151-153; 1989; Corpet ., 
Nucl Acids ties., 16:10881-10890, 1988; Huang, etal, Comp. Appl Biol ScL f 8:155-165, 1992; Pearson etaL 
Meth. Mol Biol, 24:307-331, 1994). Alignment also can be performed by simple visual inspection and manual 
alignment of sequences. 

[0087] The term "conservatively modified variation," when used in reference to a particular 
polynucleotide sequence, refers to different polynucleotide sequences that encode identical or essentially identical 
amino acid sequences, or where the polynucleotide does not encode an amino acid sequence, to essentially identical 
sequences. Because of the degeneracy of the genetic code, a large number of functionally identical polynucleotides 
encode any given polypeptide. For instance, the codons CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino 
acid arginine. Thus, at every position where an arginine is specified by a codon, the codon can be altered to any of 
the corresponding codons described without altering the encoded polypeptide. Such nucleotide sequence variations 
are "silent variations," which can he considered a species of "conservatively modified variations." As such, it will be 
recognized that each polynucleotide sequence disclosed herein as encoding a non-oligomerizing fluorescent protein also 
describes every possible silent variation. It will also be recognized that each codon in a polynucleotide, except AUG, 
which is ordinarily the only codon for methionine, and UUG, which is ordinarily the only codon for tryptophan, can be 
modified to yield a functionally identical molecule by standard techniques. Accordingly, each silent variation of a 
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polynucleotide that does not change the sequence of the encoded polypeptide is implicitly described herein. 
Furthermore, it will be recognized that individual substitutions, deletions or additions that alter, add or delete a single 
amino acid or a small percentage of amino acids (typically less than 5%, and generally less than 1%) in an encoded 
sequence can be considered conservatively modified variations, provided alteration results in the substitution of an 
amino acid with a chemically similar amino acid. Conservative amino acid substitutions providing functionally similar 
amino acids are well known in the art, including the following six groups, each of which contains amino acids that are 
considered conservative substitutes for each another: 

1) Alanine (Ala, A), Serine (Ser, S) r Threonine (Thr, T); 

2) Aspartic acid (Asp, D), Glutamic acid {Glu, E); 

3) Asparagine (Asn, N), Glutamine (Gin, Q); 

4) Arginine (Arg, R), Lysine (Lys, K); 

5) Isoieucine (lie, I), Leucine (Leu, L), Methionine (Met, M), Valine (Val, V); and 
6] Phenylalanine (Phe, F), Tyrosine (Tyr, Y), Tryptophan {T rp, W). 

[00881 Two or more amino acid sequences or two or more nucleotide sequences are considered to be 
"substantially identical" or "substantially similar" if the amino acid sequences or the nucleotide sequences share at 
least 80% sequence identity with each other, or with a reference sequence over a given comparison window. Thus, 
substantially similar sequences include those having, for example, at least 85% sequence identity, at least 90% 
sequence identity, at least 95% sequence identity, or at least 99% sequence identity. 

[0089] A subject nucleotide sequence is considered "substantially complementary" to a reference 
nucleotide sequence if the complement of the subject nucleotide sequence is substantially identical to the reference 
nucleotide sequence. The term "stringent conditions" refers to a temperature and ionic conditions used in a nucleic 
acid hybridization reaction. Stringent conditions are sequence dependent and are different under different 
environmental parameters. Generally, stringent conditions are selected to be about 5°C to 20°C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature, 
under defined ionic strength and pH, at which 50% of the target sequence hybridizes to a perfectly matched probe. 

[0090] The term "allelic variants" refers to polymorphic forms of a gene at a particular genetic locus, 
as well as cDNAs derived from mRMA transcripts of the genes, and the polypeptides encoded by them. The term 
"preferred mammalian codon" refers to the subset of codons from among the set of codons encoding an amino acid 
that are most frequently used in proteins expressed in mammalian cells as chosen from the following list: Gly (GGC, 
GGG); Glu (GAG); Asp (GAC); Val (GUG, GUC); Ala (GCC, GCU); Ser (AGC, UCC); Lys (AAG); Asn (AAC); Met (AUG); He 
(AUC); Thr (ACC); Trp (UGG); Cys (UGC); Tyr (UAU, UAC); Leu (CUG); Phe (UUC); Arg (CGC, AGG, AGA); Gin (CAG); His 
(CAC); and Pro (CCC). 

[0091] Fluorescent molecules are useful in fluorescence resonance energy transfer, FRET, which 
involves a donor molecule and an acceptor molecule. To optimize the efficiency and detectability of FRET between a 
donor and acceptor molecule, several factors need to be balanced. The emission spectrum of the donor should overlap 
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as much as possible with the excitation spectrum of the acceptor to maximize the overlap integral. Also, the quantum 
yield of the donor moiety and the extinction coefficient of the acceptor should be as high as possible to maximize RQ, 
which represents the distance at which energy transfer efficiency is 50%. However, the excitation spectra of the . 
donor and acceptor should overlap as little as possible so that a wavelength region can be found at which the donor 
can be excited efficiently without directly exciting the acceptor because fluorescence arising from direct excitation of 
the acceptor can be difficult to distinguish from fluorescence arising from FRET. Similarly, the emission spectra of 
the donor and acceptor should overlap as little as possible so that the two emissions can be clearly distinguished. 
High fluorescence quantum yield of the acceptor moiety is desirable if the emission from the acceptor is to be 
measured either as the sole readout or as part of an emission ratio. One factor to be considered in choosing the donor 
and acceptor pair is the efficiency of fluorescence resonance energy transfer between them. Preferably, the 
efficiency of FRET between the donor and acceptor is at least 10%, more preferably at least 50% and even more 
preferably at least 80%. 

10092] The term "fluorescent property" refers to the molar extinction coefficient at an appropriate 
excitation wavelength, the fluorescence quantum efficiency, the shape of the excitation spectrum or emission 
spectrum, the excitation wavelength maximum and emission wavelength maximum, the ratio of excitation amplitudes 
at two different wavelengths, the ratio of emission amplitudes at two different wavelengths, the excited state 
lifetime, or the fluorescence anisotropy. A measurable difference in any one of these properties between wild type 
Aequorea GFP and a spectral variant, or a mutant thereof, is useful. A measurable difference can be determined by 
determining the amount of any quantitative fluorescent property, e.g. r the amount of fluorescence at a particular 
wavelength, or the integral of fluorescence over the emission spectrum. Determining ratios of excitation amplitude or 
emission amplitude at two different wavelengths ("excitation amplitude ratioing" and "emission amplitude ratioing", 
respectively) are particularly advantageous because the ratioing process provides an internal reference and cancels 
out variations in the absolute brightness of the excitation source, the sensitivity of the detector, and light scattering 
or quenching by the sample. 

10093] As used herein, the term "fluorescent protein" refers to any protein that can fluoresce when 
excited with an appropriate electromagnetic radiation, except that chemically tagged proteins, wherein the 
fluorescence is due to the chemical tag, and polypeptides that fluoresce only due to the presence of certain amino 
acids such as tryptophan or tyrosine, which fluoresce when exposed to ultraviolet light, are not considered fluorescent 
proteins for purposes of the present invention. In general, a fluorescent protein useful for preparing a composition of 
the invention or for use in a method of the invention is a protein that derives its fluorescence from autocatalytically 
forming a chromophore. A fluorescent protein can contain amino acid sequences that are naturally occurring or that 
have been engineered {i.e., variants or mutants). When used in reference to a fluorescent protein, the term "mutant" 
or "variant" refers to a protein that is different from a reference protein. For example, a spectral variant of Aequorea 
GFP can be derived from the naturally occurring GFP by engineering mutations such as amino acid substitutions into 
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the reference GFP protein. For example ECFP is a spectral variant of GFP that contains substitutions with respect to 
GFP (compare SEQ ID NQS: 2 and 6). 

10094] Many cnidarians use green fluorescent proteins as energy transfer acceptors in 
bioluminescence. The term "green fluorescent protein" is used broadly herein to refer to a protein that fluoresces 
green light for example, Aequorea GFP (SEQ ID N0:2). GFPs have been isolated from the Pacific Northwest jellyfish, 
Aequorea victoria, the sea pansy, Renilla reniformis, and Phialidium gregarium (Ward et aL, Photochem. PhotobioL, 
35:803-808, 1982; Levineef aL, Comp. Biochem. PhysioL, 72B:77-85, 1982, each of which is incorporated herein by 
reference). Similarly, reference is made herein to "red fluorescent proteins", which fluoresce red, "cyan fluorescent 
proteins," which fluoresce cyan, and the like. RFPs, for example, have been isolated from the coral, Discosoma (Mate 
etaL, supra, 1999). 

[0095] A variety of Aequorea GFP-related fluorescent proteins having useful excitation and emission 
spectra have been engineered by modifying the amino acid sequence of a naturally occurring GFP from A victoria (see 
Prasher etaL, Gene 1 1 1:229-233, 1992; MtnetaL, Proc. Natl. Acad. ScL, USA 91:12501-12504, 1994; U.S. Serial 
No. 08/337,915, filed Nov. 10, 1994; International application PCT/US95J14692, each of which is incorporated 
herein by reference). As used herein, reference to a "related fluorescent protein" refers to a fluorescent protein that 
has a substantially identical amino acid sequence when compared to a reference fluorescent protein. In general, a 
related fluorescent protein, when compared to the reference fluorescent protein sequence, has a contiguous sequence 
of at least about 150 amino acids that shares at least about 85% sequence identity with the reference fluorescent 
protein, and particularly has a contiguous sequence of at least about 200 amino acids that shares at least about 95% 
sequence identity with the reference fluorescent protein. Thus, reference is made herein to an "Aequorea-islaXei 
fluorescent protein" or to a "GFP-related fluorescent protein," which is exemplified by the various spectral variants 
and GFP mutants that have amino acid sequences that are substantially identical to A. victoria GFP (SEQ ID N0:2), to 
a "Discosoma-iefotBi fluorescent protein" or a "DsRed-related fluorescent related protein," which is exemplified by 
the various mutants that have amino acid sequences substantially identical to that of DsRed (SEQ ID N0:12), and the 
like, for example, a /fe/7/7/d-related fluorescent protein or a Pfiialidium-re\ateA fluorescent protein. 

[0096] The term "mutant" or "variant" also is used herein in reference to a fluorescent protein to 
refer to a fluorescent protein that contains a mutation with respect to a corresponding wild type fluorescent protein. 
In addition, reference is made herein the a "spectral variant" or "spectral mutant" of a fluorescent protein to indicate 
a mutant fluorescent protein that has a different fluorescence characteristic with respect to the corresponding wild 
type fluorescent protein. For example, CFP, YFP, ECFP (SEQ ID N0:6), EYFP-V68L/Q69K (SEQ ID N0:10), and the like 
are GFP spectral variants. 

10097] Aequorea GFP-related fluorescent proteins include, for example, wild type (native) Aequorea 
victoria GFP (Prasher etaL, supra, 1992; see, also, SEQ ID N0:2), allelic variants of SEQ ID M0:2, for example, a 
variant having a Q80R substitution (Chalfie etaL, Science 263:802-805, 1994, which is incorporated herein by 
reference); and spectral variants of GFP such as CFP, YFP, and enhanced and otherwise modified forms thereof (U.S. 
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Pat. Nos. 6,150,176; 6,124,128; 6,077,707; 6,066,476; 5,998,204; and 5,777,079, each of which is incorporated 
herein by reference), including GFP-related fluorescent proteins having one or more folding mutations, and fragments 
of the proteins that are fluorescent, for example, an A. victoria GFP from which the two N-terminal amino acid 
residues have been removed. Several of these fluorescent proteins contain different aromatic amino acids within the 
central chromophore and fluoresce at a distinctly shorter wavelength than the wild type GFP species. For example, 
the engineered GFP proteins designated P4 and P4-3 contain, in addition to other mutations, the substitution Y66H; 
and the engineered GFP proteins designated W2 and W7 contain, in addition to other mutations, Y66W. 

[0098] Folding mutations in Aequorea GFP-related fluorescent proteins improve the ability of the 
fluorescent proteins to fold at higher temperatures, and to be more fluorescent when expressed in mammalian cells, 
but have little or no effect on the peak wavelengths of excitation and emission. If desired, these mutations can be 
combined with additional mutations that influence the spectral properties of GFP to produce proteins with altered 
spectral and folding properties, and, particularly, with mutations that reduce or eliminate the propensity of the 
fluorescent proteins to oligomerize. Folding mutations, with respect to SEQ ID N0:2, include the substitutions F64L, 
V68L, S72A, T44A, F99S, Y145F, N146I, W1153T, M153A, V163A, I167T, S175G, S2Q5T, and N212K. 

[0099] The term "loop domain" refers to an amino acid sequence of an Aequorea-rdaied fluorescent 
protein that connects the amino acids involved in the secondary structure of the eleven strands of the p-barrel or the 
central a-helix (residues 56-72). The term "fluorescent protein moiety," when used in reference to a fluorescent 
protein, refers to a portion of the amino acid sequence of the fluorescent protein that, when the amino acid sequence 
of the fluorescent protein substrate is optimally aligned with the amino acid sequence of a naturally occurring 
fluorescent protein, lies between the amino terminal and carboxy terminal amino acids, inclusive, of the amino acid 
sequence of the naturally occurring fluorescent protein, and comprises a chromophore, which fluoresces upon' 
exposure to an appropriate wavelength of light. 

101 00] Fluorescent proteins fused to target proteins can be prepared using recombinant DNA 
methods, and used as markers to identify the location and amount of the target protein produced. Accordingly, the 
present invention provides fusion proteins comprising a non-oligomerizing fluorescent protein moiety and a polypeptide 
of interest. The polypeptide of interest can be of any length, for example, about 1 5 amino acid residues, about 
50 residues, about 150 residues, or up to about 1000 amino acid residues or more, provided that the fluorescent 
protein component of the fusion protein can fluoresce or can be induced to fluoresce when exposed to electromagnetic 
radiation of the appropriate wavelength. The polypeptide of interest can be, for example, a peptide tag such as a 
polyhistidine sequence, a c-myc epitope, a FLAG epitope, and the like; can be an enzyme, which can be used to effect 
a function in a cell expressing a fusion protein comprising the enzyme or to identify a cell containing the fusion 
protein; can be a protein to be examined for an ability to interact with one or more other proteins in a cell, or any 
other protein as disclosed herein or otherwise desired. 

[0101] Fluorescent characteristics of Aequorea GFP-related fluorescent proteins depend, in part, on 
the electronic environment of the chromophore. In general, amino acids that are within about 0.5 nm of the 
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chromophore influence the electronic environment of the chromophore. Therefore, substitution of such amino acids 
can produce fluorescent proteins with altered fluorescent characteristics. In the excited state, electron density tends 
to shift from the phenolate towards the carbonyl end of the chromophore. Therefore, placement of increasing positive 
charge near the carbonyl end of the chromophore tends to decrease the energy of the excited state and cause a red- 
shift in the absorbance and emission wavelength maximum of the protein. Decreasing a positive charge near the 
carbonyl end of the chromophore tends to have the opposite effect, causing a blue-shift in the protein's wavelengths. 
Similarly, mutations have been introduced into DsRed to produce mutants having altered fluorescence characteristics 
(see Example 2). 

[0102] Amino acids with charged (ionized D, E, K, and R), dipolar (H, N, Q, S, T, and uncharged D, E 
and K), and polarizabfe side groups (e.g., C, F, H, M, W and Y) are useful for altering the ability of fluorescent proteins 
to oligomerize, especially when they substitute an amino acid with an uncharged, nonpolar or non-polarizable side 
chain (see Examples 1 and 3). As disclosed herein, substitution of hydrophobic residues that were predicted to be 
involved in self-association of GFP with positively-charged residues reduced or eliminated dimerization. However, 
other non-conservative amino acid substitutions also can be introduced similarly or at neighboring positions in the 
interacting regions of the proteins, thus disrupting the localized structure qf the protein, provided the substitutions do 
not undesirably affect the fluorescent properties of the proteins. 

[0103] The present invention also relates to polynucleotide encoding a chimeric phosphorylation 
indicator of the invention. In one embodiment, the polynucleotide encodes a chimeric phosphorylation indicator, which 
contains, in operative linkage, a donor molecule, a phosphorylatable domain, a phosphoaminoacid binding domain, and 
an acceptor molecule, wherein the phosphoaminoacid binding domain specifically binds to a phosphoaminoacid when 
present in the phosphorylatable domain, the donor molecule and the acceptor molecule exhibit a detectable resonance 
energy transfer when the donor is excited, and the phosphorylatable domain and phosphoaminoacid binding domain do 
not substantially emit light to excite the acceptor. In another embodiment, the polynucleotide encodes a chimeric 
phosphorylation indicator containing a phosphorylatable polypeptide and a fluorescent protein, wherein the 
phosphorylatable polypeptide includes an N-terminal portion and a C-terminal portion, and the fluorescent protein is 
operatively inserted between the N-terminal portion and C-terminal portion of the phosphorylatable polypeptide. In 
still another embodiment, the polynucleotide encodes a chimeric phosphorylation indicator containing a 
phosphoaminoacid binding domain operatively linked to a phosphorylatable polypeptide and a fluorescent protein, 
wherein the fluorescent protein includes an N-terminal portion and a C-terminal portion, and wherein the 
phosphorylatable polypeptide and operatively linked phosphoaminoacid binding domain is operatively inserted between 
the N-terminal portion and C-terminal portion of the fluorescent protein. 

[01 04] Also provided is a vector containing a polynucleotide of the invention, including an expression 
vector, as well as host cells that contain a polynucleotide of the invention or a vector containing such a 
polynucleotide. In one embodiment, a polynucleotide of the invention is operatively linked to an expression control 
sequence, for example, a transcription regulatory element, a translation regulatory element, or a combination thereof. 
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In another embodiment the polynucleotide is operatively linked to a nucleotide sequence encoding a membrane 
translocating domain or a cell compartmentalization domain. 

(01 05] A vector generally contains elements required for replication in a prokaryotic or eukaryotic 
host system or both, as desired. Such vectors, which include plasmid vectors and viral vectors such as bacteriophage, 
baculovirus, retrovirus, lentivirus, adenovirus, vaccinia virus, semliki forest virus and adeno-associated virus vectors, 
are well known and can be purchased from a commercial source (Promega, Madison Wl; Stratagene, La Jolla CA; 
GIBCO/BRL, Gaithersburg MD) or can be constructed by one skilled in the art (see, for example, Meth. EnzymoL, Vol. 
185, Goeddel, ed. (Academic Press, Inc., 1990); Jolly, Cane. GeneTher. 1:51-64, 1994; Flotte,*/, Bioenerg. Biomemb. 
25:37-42, 1993; Kirshenbaum etai.,1 Clin. Invest, 92:381-387, 1993; each of which is incorporated herein by 
reference). 

[0106] A vector for containing a polynucleotide encoding a chimeric phosphorylation indicator can be 
a cloning vector or an expression vector, and can be a plasmid vector, viral vector, and the like. Generally, the vector 
contains a selectable marker independent of that encoded by a polynucleotide of the invention, and further can 
contain transcription or translation regulatory elements, including a promoter sequence, which can provide tissue 
specific expression of a polynucleotide operatively linked thereto, which can, but need not, be the polynucleotide 
encoding the chimeric phosphorylation indicator, thus providing a means to select a particular cell type from among a 
mixed population of cells containing the introduced vector and recombinant nucleic acid molecule contained therein. 

(0107] Where the vector is a viral vector, it can be selected based on its ability to infect one or few 
specific cell types with relatively high efficiency. For example, the viral vector also can be derived from a virus that 
infects particular cells of an organism of interest, for example, vertebrate host cells such as mammalian host cells. 
Viral vectors have been developed for use in particular host systems, particularly mammalian systems and include, for 
example, retroviral vectors, other lentivirus vectors such as those based on the human immunodeficiency virus (HIV), 
adenovirus vectors, adeno-associated virus vectors, herpesvirus vectors, vaccinia virus vectors, and the like (see Miller 
and Rosman, BioTechniques 7:980-990, 1992; Anderson etal., Nature 392:25-30 Suppl., 1 998; Verma and Somia, 
Nature 389:239-242, 1997; Wilson, New Engl. J. Med. 334:1 185-1 187 (1996), each of which is incorporated herein 
by reference). 

[0108] Recombinant production of a chimeric phosphorylation indicator involves expressing a 
polypeptide encoded by a polynucleotide. The sequence of the polynucleotide can be confirmed using routine methods, 
including, for example, PCR methods (see, for example, U.S. Pat. No. 4,683,195; Mullis etaL, Cold Spring Harbor 
Symp. Quant. Biol. 51:263, 1987; Erlich, ed., "PCR Technology" (Stockton Press, NY, 1989)). The construction of 
expression vectors and the expression of a polynucleotide in transfected cells involves the use of molecular cloning 
techniques also well known in the art (see Sambrook etaL, In "Molecular Cloning: A Laboratory Manual" (Cold Spring 
Harbor Laboratory Press 1989); "Current Protocols in Molecular Biology" (eds., Ausubel etal:, Greene Publishing 
Associates, Inc., and John Wiley & Sons, Inc. 1990 and supplements). Expression vectors contain expression control 
sequences operatively linked to a polynucleotide sequence of interest, for example, that encoding a non-oligomerizing 
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fluorescent protein, as indicated above.. The expression vector can be adapted for function in prokaryotes or 
eukaryotes by inclusion of appropriate promoters, replication sequences, markers, and the like. An expression vector 
can be transf ected into a recombinant host cell for expression of a chimeric phosphorylation indicator, and host cells 
can be selected, for example, for high levels of expression in order to obtain a large amount of isolated protein. A host 
cell can be maintained in cell culture, or can be a cell in vivo in an organism. 

[0109] An expressed chimeric phosphorylation indicator can be operatively linked to a polypeptide of 
interest, for example, a peptide tag, which can be used to facilitate isolation of the indicator. For example, a 
polyhistidine tag containing, for example, six histidine residues, can be incorporated at the N-terminus or C-terminus of 
the chimeric phosphorylation indicator, which then can be isolated in a single step using nickekhelate 
chromatography (see Example 1). Additional peptide tags, including a c-myc peptide, a FLAG epitope, or any ligand (or 
cognate receptor), including any peptide epitope (or antibody, or antigen binding fragment thereof, that specifically 
binds the epitope are well known in the art and similarly can be used, (see, for example, Hopp et aL, Biotechnology 
6:1204, 1988; U.S. Pat. No. 5,01 1,912, each of which is incorporated herein by reference). 

[0110] Kits also are provided to facilitate and, where desired, standardize the compositions of the 
invention and the uses thereof. A kit can contain one or more compositions of the invention, for example, one or a 
plurality of chimeric phosphorylation indicators, or one or a plurality of polynucleotides that encode the indicators. In 
addition, the kit can contain instructions for using the components of the kit, particularly the compositions of the 
invention that are contained in the kit. 

[0111] Such kits can be particularly useful where they provide a plurality of different chimeric 
t phosphorylation indicators because the artisan can conveniently select one or more indicators having the properties 
desired for a particular application. Similarly, a kit containing a plurality of polynucleotides encoding different 
chimeric phosphorylation indicators provides numerous advantages. For example, the polynucleotides can be 
engineered to contain convenient restriction endonuclease or recombinase recognition sites, thus facilitating operative 
linkage of the polynucleotide to a regulatory element or to a polynucleotide encoding a phosphorylatable domain or 
PAABD of interest. , 

101 1 2] The present invention further relates to a method for detecting a kinase or phosphatase in a 
sample. In one embodiment, a method of the invention is performed, for example, contacting the sample with a 
chimeric phosphorylatable indicator, which contains, in operative linkage, a donor molecule, a phosphorylatable 
domain, a phosphoaminoacid binding domain, and an acceptor molecule, wherein the phosphoaminoacid binding domain 
specifically binds to a phosphoaminoacid when present in the phosphorylatable domain, the donor molecule and the 
acceptor molecule exhibit a detectable resonance energy transfer when the donor is excited, and the phosphorylatable 
domain and phosphoaminoacid binding domain do not substantially emit light to excite the acceptor; exciting the donor 
molecule; and determining a fluorescence or luminescence property in the sample, wherein the presence of a kinase or 
phosphatase in the sample results in a change in the degree of FRET or LRET, thereby detecting the kinase or 
phosphatase in the sample. The change in the degree of FRET or LRET can be an increased amount of FRET or LRET, 
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or can be a decreased amount of FRET or LRET, and the change can be indicative of the presence of a kinase in the 
sample, or, where the phosphorylatable domain is phosphorylated prior to contacting the sample with a chimeric 
phosphorylatable indicator, can be indicative of a phosphatase in the sample. 

[0113] Fluorescence in a sample generally is measured using a fluonmeter, wherein excitation 
radiation from an excitation source having a first wavelength, passes through excitation optics, which cause the 
excitation radiation to excite the sample. In response, a non-oligomerizing fluorescent protein in the sample emits 
radiation having a wavelength that is different from the excitation wavelength. Collection optics then collect the 
emission from the sample. The device can include a temperature controller to maintain the sample at a specific 
temperature while it is being scanned, and can have a multi-axis translation stage, which moves a microliter plate 
holding a plurality of samples in order to position different wells to be exposed. The multi-axis translation stage, 
temperature controller, auto-focusing feature, and electronics associated with imaging and data collection can be 
managed by an appropriately programmed digital computer, which also can transform the data collected during the 
assay into another format for presentation. This process can be miniaturized and automated to enable screening 
many thousands of compounds in a high throughput format. These and other methods of performing assays on 
fluorescent materials are well known in the art (see, for example, Lakowicz, "Principles of Fluorescence 
Spectroscopy" (Plenum Press 1983); Herman, "Resonance energy transfer microscopy" In "Fluorescence Microscopy 
of Living Cells in Culture" Part B, IWeth. Cell Biol. 30:219-243 (ed. Taylor and Wang; Academic Press 1989); Turro, 
"Modern Molecular Photochemistry" (Benjamin/ Cummings Pub!. Co., Inc. 1978), pp. 296-361, each of which is 
incorporated herein by reference). 

I01 14] In another embodiment, a method for detecting a kinase or phosphatase in a sample is 
performed by contacting the sample with a chimeric phosphorylatable indicator containing a phosphorylatable 
polypeptide and a fluorescent protein, determining a fluorescence property in the sample, wherein the presence of 
kinase or phosphatase activity in the sample results in a change in the fluorescence property as compared to the 
fluorescent property in the absence of a kinase or phosphatase activity, thereby detecting the kinase or phosphatase 
in the sample. The chimeric phosphorylation indicator can containing a phosphorylatable polypeptide that includes an 
N-terminal portion and a C-terminal portion, such that the fluorescent protein is operatively inserted between the 
N-terminal portion and C-terminal portion of the phosphorylatable polypeptide; or the chimeric phosphorylation 
indicator can contain a phosphoaminoacid binding domain operatively linked to a phosphorylatable polypeptide, which 
is operatively inserted between an W-terminal portion and a C-terminal portion of the fluorescent protein. 

[01 15] The sample to be examined for kinase activity can be any sample, including, for example, a 
sample containing a synthetic product to be examined for kinase or phosphatase activity. In one embodiment, the 
sample is a biological sample, which can be cell, tissue or organ sample, or an extract of such a sample. In another 
embodiment, the method is performed on an intact cell, which can be in cell culture or can be in a tissue sample. For 
such a method, the chimeric phosphorylatable indicator can contain a targeting sequence such as a cell 
compartmentalization domain that can target the chimeric phosphorylatable indicator to cytosol, endoplasmic 
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reticulum, mitochondrial matrix, chloroplast lumen, medial trans-Golgi cisternae, a lumen of a lysosome, or a lumen of 
an endosome. A membrane translocating domain can be a particularly useful cell compartmentalization domain is a 
membrane translocating domain, which can facilitate translocation of the chimeric phosphorylation indicator into an 
intact cell. 

[0116] The phosphorylatable polypeptide in a chimeric phosphorylation indicator comprising a 
fluorescent protein and a phosphorylatable polypeptide can be unphosphorylated or phosphorylated at an amino acid 
position specific for a kinase or a phosphatase, depending on whether the method is for detecting a kinase or 
phosphatase. A method of the invention also can be used to detect an absence of kinase or phosphatase activity in 
the sample is due to the presence of a kinase inhibitor or phosphatase inhibitor. As such, the method is useful for 
identifying a kinase inhibitor or a phosphatase inhibitor. 

[0117] The following examples are intended to illustrate, but not limit, the present invention. 

EXAMPLE 1 

Preparation and Characterization of a Chimeric Reporter 
Protein for a Serine/Threonine Protein Kinase 

[0118] This example provides a method for preparing a chimeric protein kinase A (PKA; cAMP» 
dependent protein kinase) reporter protein, and demonstrates that such a chimeric reporter molecule can detect 
serine/threonine kinase activity. 

Plasmid Construction 

[0119] The PKA chimeric reporter protein was constructed by fusing the enhanced cyan fluorescence 
protein (1-227; ECFP; SEQ ID N0:6; K26R/F64L/S65T/Y66 W/W 7 46I/M 1 53T/V 1 63 A/ W 1 B4H), a truncated version of 
14-3-3t, a modified kemptide and citrine, which is an improved yellow fluorescence protein having a sequence as set 
forth in SEQ ID NO: 10, except containing a Q69M mutation. 14-3-3t (1-232) was amplified using the cDNA of 14-3- 
3t (see GenBank Accession No. D87662, which is incorporated herein by reference) in pcDNA3 vector as the 
template. 

[0 1 20] For PCR, the forward primer had the sequence 
B'-GGGCATGCATATGGAGAAGACTGAGCTGATCCAG-S' (SEQ ID N0:26), and incorporates a Sph 1 site. The reverse 
primer had the sequence 

5 ? -CGCGGAGCTCGCTGCCGCCGGTGCCGCCCAGGCTGGCGCGACGGAGGCTGCCGCCGGTGCCGCCTGCAGAGTCTGAT 
GTCCAAAGTGTTAGG-3' (SEQ ID N0:27), which introduces a short linker peptide (AGGTGGS; SEQ ID NQ:19), the 
kemptide sequence (LRRASLG; SEQ ID N0:32), a second short linker peptide (GTGGSEL; SEQ ID W0:21) and a Sac I 
site. 
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[0121] PCR was performed using 50 ng of template, 300 nWl of each primer, 500 nWI of each dNTP, 
2.5 Unit of Taq polymerase (Gibco) in 1X PCR reaction buffer (Boehringer Mannheim) with nanopure water (50 pi 
total volume). PCR was performed as follows: 95°C, 5min; 2 cycles of (95°C, 1 min; 40°C, 1 min; 72 D C, 2.5 min); 5 
cycles of |95°C, 1 min; 43°C, 1 min; 72°C, 2.5 min); 5 cycles of (95°C, 1 min; 45°C, 1 min; 72°C, 2.5 min); 15 cycles 
of (95°C, 1 min; 52°C, 1 min; 72°C, 2.5min; 72°C, 7 min), then hold at 25°C. 

[0122] The amplification product was purified using the Qiagen gel purification kit, then digested 
with Sph I and Sac I overnight. The digested mixture was purified using Qiagen PCR purification kit, and the purified 
fragment was ligated into Sph I/Sac l-digested pRSETB (Invitrogen) containing the cDMA sequence for ECFP and 
citrine (from Yellow cameleon 3.3). The construct was within the Bam Hl/Eco Rl sites of pRSETB, and is behind a 
polyhistidine tag for bacterial expression. The resulting plasmid was amplified, sequenced and mutagenized using the 
QuickChangeTM site-directed mutagenesis kit (Stratagene) to introduce one amino acid change in the kemptide 
sequence, generating the plasmid C4kY2.1 -pRSETB. 

101 23] Primers for mutagenesis had the following sequence: 
5'-CGTCGCGCCAGCCTGCCAGGCACCGGCGGCAGC-3' (SEQ ID N0:28), and 

5'-GCTGCCGCCGGTGGCTGGCAGGCTGGCGCGACG-3' (SEQ ID N0:29). The S475A mutant was generated similarly 
using the following primers: GCCTCCGTCGCGCCGCACTGCCAGGCACCGGC (SEQ ID N0:30); and 
GCCGGTGCCTGGCAGTGCGGCGCGACGGAGGC (SEQ ID N0:31). 

[0124] For mammalian expression, both C4kY2.1 and C4kY2.1(S475A) were cloned into the vector 
pcDNA3 behind a Kozak sequence for mammalian expression. 

Protein Expression, Phosphorylation and in vitro Spectroscopy 

[01 25] C4kY2.1 -pRSETB and C4kY2.1 (S475A)-pRSETB each were transformed into £ coli strain 
BL2KDE3). A single colony was picked and grown in 100 to 500 ml LB medium containing 0.1 mg/ml ampicillin at 
37°C to an optical density of 0.4-0.8 at 600 nm, then induced with 0.1 mg/ml isopropyl thiogalactoside (IPTG) at 
25°C for 12 to 24 hr. Cells were harvested by centrifugation, then the bacterial pellet was suspended in 4 to 1 0 ml 
B-PERTM II Bacterial Extraction Reagent (Pierce) and lysed by gentle shaking at 25°C for 15 min in the presence of 
protease inhibitors (CompleteTM EDTA-free Protease Inhibitor tablet (Roche), 1 mM phenylmethylsulfonyl fluoride). 
The lysate was clarified by centrifugation at 12,000g for 30 min at 4°C. 

[01 26] Binding of the Hise tag to Ni-NTA agarose (Qiagen) was carried out in a batch mode. The 
supernatant was filtered through a 0.22 p,M syringe filter, then transferred to a new tube, to which 0.3 to 1 ml of the 
50% (v/v) Ni-NTA slurry was added. The suspension was mixed gently on a rotary shaker at 4°C for 1 hr. The lysate- 
Ni-NTA mixture was loaded into a column, which was washed with 10 volumes of TNS300 buffer (Tris-HCl, pH7.4, 
300mM NaCI) and 10 volumes of TNS300 containing 10 mM imidazole. The chimeric protein was eluted with 1 to 
3ml of elution buffer (100mM imidazole in TNS300) and dialyzed in TNS300 buffer at 4°C for 12 to 24 hr. When 
necessary, the protein was concentrated using a YM-30 Microcon or Centricon concentrator (Fisher). 
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[01271 Chimeric proteins were phosphorylated with the catalytic subunit of PKA (New England 
Biolabs; 2.5U/juL) in PKA kinase reaction buffer (5QmM Tris-HCl, pH7.5, 10mM MgCk, 1mg/ml BSA) in the presence 
of an optional amount of adenosine triphosphate, ATP (200 jiM to 1 mM) at 25°C. Fluorescence was monitored using 
an excitation wavelength of 434 nm before and after adding ATP to the rest of the components. Experiments were 
performed in a Perkin-Elmer Spectrofluorometer IS50B. Emission spectra (excitation 380nm, emission 430 to 650nm) 
were collected and 536/476 emission ratios calculated at each time point. 

[01 28] For/3? vitro kinase assay, different constructs were incubated with 7.5 unit of catalytic 
subunit of PKA and 6.5 nM (1.2 \lC\) (y- 32 P)-ATP (6000Ci/mmol, New England Nuclear) in PKA kinase reaction buffer, 
in a total volume of 30 jxL at 25°C for 30 min to 12 hr. Ten jd. of the reaction mixture was spotted onto a 
phosphocellulose disk, immersed in 0.5% H3PO4 and washed with the PKA kinase reaction buffer three times, for 10 
min each. Transfer of 32 P was measured by standard scintillation counting. 

[0129] For testing the specificity of the reporter, fluorescence change and transfer of 32P upon 
incubation with CaMKII (New England Biolabs) and PKC II were followed similarly. The reaction conditions for 
CaMKII was SU/pl CaMKII, ImM ATP or 6.5 nM (1.2 jnCi} (y- 32 P)ATP (6000Ci/mmol, NEN), 2 mM CaCb, 2.4 \M 
calmodulin in CaMKII buffer (20 mM Tris-HCl, pH 7.5, 10 mM MgCk, 0.5 mM dithiothreitol and 0.1 mM Na2EDTA) at 
25°C for 30 min to 12 hr. The reaction conditions for PKC II was 3%L PKC II, 1mM ATP or 6.5 nM (1.2 jiCi) (y- 
32 P)-ATP (6000Ci/mmol), 5 mM CaCh, 140 jaM phosphatidylserine and 3.8 \M diacylglycerol in buffer (20 mM Tris- 
HCl, pH 7.5, 10 mM MgCh) at 25°C for 30 min to 12 hr. 

101 30] For testing phosphatases, the PKA-phosphorylated chimeric reporter protein was 
concentrated using a YM-30 Microcon concentrator and an equal volume of TNS300 buffer was added. The protein 
was then purified by Wi-NTA as described above. The purified protein was dephosphorylated with 2.5 U protein 
phosphatase 1 (PP1; New England Biolabs) in the presence of 1 mM MnCk in PP1 buffer (50mM Tris-HCl, pH 7.0, 
0.1 mM Na 2 EDTA, 5 mM DTT, 0.01 % Brij35) at 25°C for 30 min to 3 hr. 

Cell Culture 

[0131] HeLa cells or COS-7 cells were plated onto sterilized glass coverslips in 2 cm dishes or in 
10 cm plates and grown to 50-90% confluency at 37 °C in low-glucose DMEM medium. Cells were transfected with 
the FuGEME-6 transfection reagent (Roche). Briefly, cells on 24-mm glass coverslips in a 2 cm dish were transfected 
with 0.5 ng DNA (purified using the Midiprep Kit from Qiagen), 100 \il of OptiMEMI medium and 3 pi of FuGENE-6. 
After a 30 min incubation at room temperature, the mixture was added to the cells. 

[0132J For imaging analysis, cells were incubated for 24-72 hr at 37 °C in fetal bovine serum in low 
glucose DMEM, then washed twice with HBSS buffer (20 mM Hepes, pH 7.4, 2g/L D-glucose), maintained in buffer in 
the dark at room temperature, with addition of forskolin (FsK; Calbiochem) and N6, 2'-0-dibutyryl cyclic adenosine 
3\5' monophosphate (dbcAMP; Calbiochem). FsK was dissolved in DMSO to be used as a 50 mM stock solution. To 
add a DMSO solution, 500 \iL of HBSS buffer was taken from the imaging dish and mixed with the DMSO solution, 

-31- 



I 

WO 02/095058 PCT/US02/16955 

then added back to the imaging dish. For inhibition studies of PKA activity, cells were preincubated in DMEM medium 
containing 10 jiM H-89 for approximately 2hr. 
Imaging Studies 

10133] Cells were imaged on a Zeiss Axiovert microscope with a cooled CCD camera (Photometries; 
Tucson AZ), controlled by Metafluor 2.75 software (Universal Imaging; West Chester PA). Dual-emission ratio 
imaging used a 440DF30 excitation filter, a 455DRLP dichroic mirror and two emission filters (480DF30 for ECFP, 
535DF25 for citrine) altered by a filter changer (Lambda 1 0-2, Sutter Instruments, San Rafael, C A). Fluorescence 
images were background-corrected. Exposure time was 1000 ms and images were taken every 15sec. 

Results 

10134] The protein kinase, PKA, which is involved in cell signaling, was used to exemplify a 
ratiomeric indicator for serine/threonine phosphorylation. cAMP is one if the main second messengers and plays an 
important role as a signal transducer in many cellular activities, the signal being transmitted by activation of PKA. 
PKA regulates specific gene expression by phosphorylating specific nuclear proteins at serine/threonine residues 
within consensus sequences. The 14-3-3 family of proteins mediates signal transduction by binding to phosphoserine- 
containing proteins and, therefore, can contain a consensus sequence that could encompass consensus site for PKA. 
Accordingly, fusion proteins were constructed in which a GFP FRET pair were linked by a 14-3-3 protein and a 
substrate peptide. Upon phosphorylation, binding between 14-3-3 and the substrate can bring two GFPs into 
proximity such that FRET is enhanced as the readout for phosphorylation. For the GFP FRET pair, ECFP and citrine, 
which is an improved yellow fluorescence protein with a better stability to acid conditions, were selected. The C- 
terminal tail of 14-3-3 was omitted, and replaced with a flexible linker AGGTGGS (SEO ID N0:19). Another flexible 
linker GGTGGSEL (SEQ ID N0:21 ) was inserted between the substrate peptide and citrine. 

10135] A number of chimeras containing different substrate sequences, including the kemptide 
(LRRASLG; SEQ ID N0:32), a modified kemptide (LRRASLP; SEQ ID N0:20) and a peptide sequence derived from Raf- 
259 (AQRSTSTPN; SEQ ID I\I0:33), were generated. The C4kY2.1 reporter (Figure 6B) gave the best response to the 
activation of PKA. C4kY1.1 containing the kemptide sequence also was responsive to the activation of PKA. While 
the 14-3-3 recognized a consensus sequence RSXSXP (SEQ ID NQ:34), it was rationalized that mutagenizing the 
glycine at the +2 position with respect to the serine, to a proline, could increase the binding affinity. As such, the 
C4UY2.1 chimeric reporter protein was constructed containing a modified kemptide sequence. 

In vitro Characterization of a Chimeric Reporter Protein 

[0136] The chimeric reporter protein C4kY2.1 was efficiently phosphorylated by the catalytic subunit 
of PKA, while the mutant C4kY2.1(S475A) protein was not phosphorylated in the in vitro phosphorylation assay. This 
result indicates that phosphorylation occurs at the designated serine in the modified kemptide sequence 
(LRRAS475LP; SEQ ID N0:20) containing the consensus PKA phosphorylation site (RRXS; SEQ ID N0:35). 
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10137] The effect of a structural alteration due to phosphorylation on the efficiency of FRET also 
was examined. Excitation of the fluorophore in the ECFP at 434 nm results in fluorescence emission at 526 nm from 
the citrine due to FRET between the two fluorophores. When the C4kY2.1 chimeric reporter protein was 
phosphorylated, FRET was increased and the ratio of the emission at 526 nm and 476 nm increased from a value of 
1.09 to 1.42 in a time-dependent manner. Negative controls, in which either PKA or ATP were omitted, demonstrated 
that FRET change is dependent on the PKA phosphorylation. The mutant C4kY2.1 (S475A), in which the designated 
serine was mutated to an alanine did not give any change of FRET upon incubation with PKA catalytic subunit and 
1 mM ATP. This result demonstrates that the FRET change is due to a conformational change, for example, due to 
binding between the phosphorylated peptide and 14-3-3, which is induced by the PKA phosphorylation at the serine 
475. 

[0138] Following treatment of the C4kY2.1 chimera with 10 fig of trypsin for 40 min (Miyawaki and 
Tsien, Meth. Enzymol. 37:472-500, 2000, which is incorporated herein by reference), FRET efficiencies of 
phosphorylated and nonphosphorylated C4kY2.1 were calculated to be 20 and 30%, respectively. Furthermore, the 
PKA/ATP-dependent FRET change was reversible upon treatment with protein phosphatase 1 (PP1), which is specific 
for phosphorylated serine/threonine. When the purified PKA-phosphorylated chimera was incubated with PP1, FRET 
was decreased and the emission ratio of 526 nm and 476 nm decreased from a value of 1 .39 to 1 .14 in a time- 
dependent manner. 

[0139] The specificity of the chimeric PKA reporter protein C4kY2.1 was examined by applying 
CaMKII and PKC II to the chimera in vitro. These two kinases have the consensus sequences RXXS/T (SEQ ID N0:36) 
and K/RXXS/T (SEQ ID N0:37), respectively, which overlap with the kemptide sequence. The chimera was poorly 
phosphorylated by both kinases, and incubation of the chimera with the either kinase did not result in a significant 
FRET change. These results demonstrate that the C4kY2.1 chimera is specific to PKA phosphorylation//; vitro. 

Expression and Behavior of PKA Reporter in Mammalian Cells 

[0140] When the chimeric PKA reporter C4kY2.1 was transfected into HeLa cells, the fluorescence 
was uniformly distributed in the cytosolic compartment, but was excluded from the nucleus, as expected for such a 
chimeric protein that lacked targeting signals. A similar expression pattern also was observed in COS-7 cells. To 
elevate the activities of PKA, transfected cells were treated with cAMP-elevating agents. Exposure of the cells to 50 
\M FsK and 1 mM dibutyryl cAMP increased the 535 nm/480 nm ratio in a time-dependent manner. Typically the 
ratio reached a plateau at 5 min after stimulation. When C4kY2.1(S475A) transfected cells were treated in the same 
condition, no change in the 535 nm/480 nm ratio was observed. This result is consistent with the intramolecular 
binding between 14-3-3 and the peptide phosphorylated by PKA at the serine475 being responsible for the FRET 
change observed. 

[D141] Either FsK or dbcAMP, alone, induced the FRET change. In general, the change in the 
emission ratio 535 nm/480 nm was from 25% to 34%. In addition, the change and kinetics were dependent on the 
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fluorescence intensity of the transfected cells; i.e., the brighter the cell, the bigger the FRET change and the more 
slowly the plateau was reached. Administration of 1 mM dbcAMP, alone, slowly increased the 535 nm/480 nm ratio, 
while pretreatment with 1 0 \M H-89, a PKA inhibitor, for 2 hr slowed down the process even more, but did not 
completely inhibited it. The change in the emission ratio 535 nm/480 nm in the cells pretreated with H-89 was 46%, 
suggesting that the pretreatment with the PKA inhibitor reduced the basal level of PKA phosphorylation inside cells 
prior to the stimulation, giving rise to a greater FRET change. A similar increase in the 535 nm/480 nm ratio was 
observed in COS-7 cells treated with FsK and with dbcAMP, although the change was smaller (about 5%) and a higher 
starting ratio was observed. 

EXAMPLE 2 

Preparation and Characterization of Chimeric Reporter 
Proteins for Detecting Tyrosine Kinase Activity 

[0142] This example provides methods for preparing a chimeric src reporter protein and a chimeric 
EGFR reporter protein, and demonstrates that such chimeric reporter proteins can detect tyrosine kinase activity. 

Preparation of the Chimeric Gene Encoding the Chimeric E6FR Reporter Protein 

[0143] The SH2 domain from mouse p52 She (see Lanning and Lafuse, Immunogenetics 49:498-504, 
1999; GenBank Accession No. AF054823, each of which is incorporated herein by reference) was amplified by PCR 
using the following primers: 

EGFR.Fwd • (5'-GCCGCCCGCATGCATTGGTTCCACGGGAAGCTGAGC CGG-3'; SEQ ID NQ:38), and 

EGFRoptsub.Rev - (5'-TACCATGAGCTCTGATTGCGGAGCCATGTTCATG 
TACTCAGCTTCCTCTTCAGGCTTCCCAGATCCAGAGTGAGACCCCACGGGTTGCTCTAGGCACAG-3'; SEQ ID N0:39). 

[0144] The PCR reaction was assembled as: 67 pi of water, 1 0 jxL of 10X Taq buffer (Promega), 
16 \xl of 25 mM MgCb, 2 pi of a 25 mM stock of dNTPs, 1.5 |xL of 100 pM EGFR.Fwd primer, 1.5 \il of 100 p.M 
EGFRoptsub.Rev primer, 1 pi of 0.1 p.g/pl p52 She template, and 1 pi of Taq polymerase (Promega). The following 
cycle was used: 94 °C, 2 min; 2 cycles of (95 °C, 2 min; 40 °C, 2 min; 72 °C, 2:30 min); 28 cycles of (95 °C, 1 min; 
52 °C, 1 min; 72 °C, 2:30 min); 72 °C, 10 min; 25 °C hold. 

[0145] The PCR product was digested with Sac I and Sph I and ligated into similarly digested YC3.3 
plasmid, which contained the genes for EYFP and ECFP behind an N-terminal Hiss tag, and is derived from the vector 
pRSETB. This construct, called "Eopt-pRSETB", was used for bacterial expression of the EGFR reporter. The 
structure of the chimeric EGFR reporter protein is illustrated in Figure 7A. 

[0146] For mammalian expression, the Eopt-pRSETB plasmid was cloned into the vector pCDI\IA3 
using the Bam HI and Eco Rl restriction sites. Constructs prepared in this way retained their N-terminal Hiss tags. 
The mammalian expression plasmid was named "Eopt-pCDNAS". 



-34- 



WO 02/095058 



PCT7US02/16955 



Preparation of the Chimeric Gene Encoding the Chimeric Src Reporter Protein 

[0147] The SH2 domain from v-Src was amplified by PCR using the following primers: 
SrcSH2.Fwd - (5'-GCCGCTCGCATGCATTGGTATTTTGGGAAGATCAC-3'; SEQ ID N0:40); and 
SrcSH2.Rev - (5'-CACCATGAGCTCAAATTCACCGTAGATCTCAGAACCCT 
CACCAGAACCCGGCTTCCCAGATCCAGATGTAGACCCACAGACGTTAGTCAGGCG-3'; SEQ ID N0:41). 

[0148] The PCR reaction was assembled as 67 pi of water, 1 0 jliL of 1 0X Taq buffer (Promega), 
16 pi of 25 mWl MgCk, 2 pL of a 25 mM stock of dNTPs, 1.5 jliL of 100 pM SrcSH2.Fwd primer, 1.5 \il of 100 pM 
SrcSH2.Rev primer, 1 pL of 0.1 jug/fiL v-Src template, and 1 pi of Taq polymerase (Promega). The following cycle 
was used: 94 °C, 2 min; 2 cycles of (95 °C, 2 min; 40 °C, 2 min; 72 °C, 2:30 min); 28 cycles of (95 °C, 1 min; 52 °C, 
1 min; 72 °C, 2:30 min); 72 °C, 10 min; 25 °C hold. 

10149] The PCR product was digested with Sac I and Sph I and ligated into YC3.3 plasmid digested 
with the same enzymes. This construct, called "SC-pRSETB", was used for bacterial expression of the Src reporter. 
The structure of the chimeric Src reporter protein is illustrated in Figure 7B. 

[01 50] For mammalian expression, the SC-pRSETB plasmid was cloned into the vector pCDNA3 using 
the Bam HI and Eco Rl restriction sites. Constructs prepared in this way retained their N-terminal His6 tags. The 
mammalian expression plasmid was named "SC-pCDNA3". 

Preparation of the Chimeric EGFR Reporter Protein and Chimeric Src Reporter Protein from Bacterial Cells 

[0151] For bacterial expression, chemically-competent DH5a bacteria were transformed with the 
Eopt-pRSETB plasmid. The cells were plated on LB agar supplemented with ampicillin (50 pg/ml. A single bacterial 
colony was used to inoculate an overnight 1 ml culture of LB supplemented with ampicillin (50 jig/ml), then the 1 ml 
culture was used inoculate a 500 ml culture of LB supplemented with ampicillin (50 pg/ml). The cells were grown at 
37 °C until ODeoo of about 0.4, then protein expression was induced with the addition of 500 pi of 1 M IPTG. The 
cells were incubated with shaking at 30 °C for 6 to 12 hr more, then harvested by centrifugation in GS-3 tubes at 
4°Cfor 10 min at 4000 rpm. 

[01521 The cell pellet was resuspended in 1 0 ml of B-PERII Reagent (Pierce) and the mixture was 
incubated with gentle rocking at 25 °C for 10 min, then the cell debris was pelleted by centrifugation in SS-34 tubes 
at 4 °C for 1 5 min at 1 5,000 rpm. The supernatant was separated and combined with 1 ml of Ni-NTA agarose 
(Qiagen) and the suspension was incubated at 4 °C for 30 min with gentle rotation. The suspension then was 
transferred to a screening column and the beads were rinsed with 2 x 8 ml of 50 mM Tris pH 7.4/300 mWl NaCI/10 
mM imidazole. The chimeric protein was eluted with 5 ml of 50 mM Tris pH 7.4/300 mM NaCI/100 mM imidazole, 
then dialyzed overnight in 50 mM Tris pH 7.4/50 mM NaCI/10 mM MgCb and stored in small aliquots at -20 °C. The 
typical yield was 5 ml of 1-5 pM pure EGFR reporter protein. 
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[01 53] The same procedure was used to prepare the chimeric Src reporter protein from bacteria! 

cells. 

Expression of the Chimeric EGFR Reporter Protein and Chimeric Src Reporter Protein in Mammalian Cells 

101 54] Eopt-pCDNA3 plasmid was amplified and purified using the endotoxin-free midiprep kit from 
Qiagen. For fluorescence measurements in living cells, mouse B82 cells, HeLa cells, or MIH3T3 cells in 2 cm dishes at 
50-90% confluence were transfected with 1 fxg of the Eopt-pCDNA3 plasmid using Fugene (Roche) according to 
standard protocols. The cells were incubated with the DIMA for 10-24 h at 37 °C in 5% CO2 and 1 0% calf (B82 cells 
and NIH3T3 cells) or fetal bovine serum (HeLa cells) in high glucose DMEM. The cells were then serum-starved in 
0.5% calf serum (in high glucose DMEM) for 6 to 24 hr. 

(01 55] For biochemical analysis of the reporter expressed in mammalian cells, mouse B82 cells, HeLa 
cells, or MIH3T3 cells in 10 cm dishes at 50-90% confluence were transfected with 1 of the Eopt-pCDI\IA3 plasmid 
using Effectene (Qiagen) according to standard protocols. The cells were incubated with the DMA for 10 to 24 hr at 
37 °C in 5% CO2 and 10% calf (B82 cells and NIH3T3 cells) or fetal bovine serum (HeLa cells) in high glucose DMEM. 
The cells were then serum-starved in 0.5% calf serum (in high glucose DMEM) for 6 to 24 hr. 

[0156] Expression of the chimeric EGFR reporter protein was assessed by fluorescence in vitro. The 
assay was performed at room temperature in 50 mM Tris, pH 7.4, 50 mM NaCI, 10 mM MgCk, 100 pM ATP and 
EGFR enzyme (Sigma). CFP was excited at 400 nM, and CFP and YFP emissions were measured at 75 nM and 
525 nm, respectively. A cutoff filter at 420 nm was used. The YFPrCFP ratio increased only when EGFR and ATP 
were included in the reaction mixture. 

[0157] The identical procedure was used to express the chimeric Src reporter protein in mammalian 
cells, except that the in vitro response was performed using 50 mM Tris, pH 7.4, 50 mM NaCI, 10 mM MgCb, 
100 \M ATP and c-Src enzyme (Upstate Biotechnology). CFP was excited at 432 nM, and CFP and YFP emissions 
were measured at 475 nm and 525 nm, respectively. 

Fluorescence Measurements of the Chimeric EGFR and Src Reporter Proteins in Mammalian Cells 

[0158] Live-cell imaging was performed as described for the cameleon Ca+ 2 indicators (Miyawaki et 
aL, Nature 388:882-887, 1997, which is incorporated herein by reference). EGFR was stimulated in HeLa cells and in 
mouse L cells by addition of 50 ng/ml of EGF (Sigma). For measuring the effect of inhibitor on the EGFR reporter, 100 
nM of AG1478 (Calbiochem) was incubated with the cells in HBSS (20 mM Hepes, pH 7.4, 2 g/L D-glucose) for 30 min 
at room temperature prior to application of EGF. 

[01 59] Addition of EGF to the transfected mouse L cells resulted in an immediate increase in the 
emission ratio, and reached a plateau at about a 27% emission ratio change by about 2 minutes. Following wash-out 
of the EGF from the culture, the emission ratio decreased in a time-dependent manner, and the decrease was 
accelerated by the addition of the AG1478 EGFR inhibitor. No response was observed when cells were incubated 
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with the EGFR inhibitor prior to addition of EGF, whereas the Src kinase inhibitor, PP1, had no effect on the increase 
in emission ratio due to EGF. 

[01 60] Src activity was stimulated in HeLa cells and in N1H3T3 cells with EGF (50 ng/ml) and PDGF 
(50 ng/ml), respectively. To inhibit Src activity, PP1 (A.G. Scientific) was incubated with the cells at 100 nM for 30 
min at room temperature. FRET was measured in the HeLa cells, and a 25% FRET decrease was measured in 
response to Src activation. The HeLa cells demonstrated membrane ruffling, characteristic of EGF-stimulated cells. A 
similar FRET decrease occurred in the NIH3T3 cells following administration of PDGF. In both HeLa cells and WIH3T3 
cells, the Src inhibitor, PP1, inhibited the FRET response, whereas the EGFR inhibitor, AG1478, had no significant 
effect. 

Biochemical Analysis of the Chimeric EGFR or Src Reporter Proteins Expressed in Mammalian Cells 

[0161] Reporters expressed in mammalian cells were analyzed by immunoprecipitation (IP), western 
blot analysis, or fluorescence of the purified protein. For IP and western blot analysis, the following procedure was 
used. Following exposure to the appropriate stimulant or chemical (EGF, PDGF, or inhibitor), transfected cells in 10 
cm culture dishes were rinsed twice with ice-cold HBSS (20 mM Hepes, pH 7.4, 2 g/L D-glucose) and lysed with 1.5 
ml lysis buffer (50 mM Tris, pH 7.4, 100 mM NaCI, 1 mM EDTA, 0.1% Triton X-100, 10 mM NaF, 2 mM sodium 
orthovanadate, 1 mM phenylmethylsuifonyl fluoride, 10 jig/ml pepstatin, 10 ng/ml feupeptin, and 10 jag/ml aprotinin). 
The suspensions were cleared by centrifugation, and the reporters were immunoprecipitated from the supernatant 
using a rabbit polyclonal anti-GFP antibody immobilized on protein G-Sepharose resin (Amersham). Western blot 
analysis was performed on samples run in 12% PAGE-SDS gels using a mouse monoclonal anti-phosphotyrosine 
antibody (Upstate Biotechnology), a rabbit monoclonal anti-EGFR antibody (Calbiochem), or a mouse monoclonal 
anti-GFP antibody (Clontech) with secondary detection by anti-rabbit HRP or anti-mouse HRP. 

[0162] For fluorescence analysis of purified reporters, the crude cell lysates were purified over a 
column of Mi-NTA agarose (Qiagen). The His6-tagged reporters were eluted with 50 mM Tris pH 7.4/300 mM 
NaCI/100 mM imidazole and dialyzed into 50 mM Tris pH 7.4/50 mM NaCI/10 mM MgCk. 

EXAMPLE 3 

Identification of Chimeric Phosphorylation Indicators 
Using High Throughput Screening 

[0163] Based on the results described Examples 1 and 2, additional chimeric phosphorylation 
indicators can be developed and optimized, using a variety of fluorescent proteins, luminescent molecules, kinase or 
phosphatase substrates, PAABDs, and tetracysteine motifs. High throughput strategies are particularly useful for 
systematically generating and testing diverse libraries of such constructs (Zhao and Arnold, Cum Opm, Chem. Biol 
3:284-290, 1997, which is incorporated herein by reference). 
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10 1 64] Initial high throughput diversity generation and screening are performed using the exemplified 
chimeric phosphorylation indicators (Examples 1 and 2), which are active in mammalian cells. Iterative cycles of 
variegation, for example (see U.S. Pat. Wo. 5,837,500, which is incorporated herein by reference), can confer 
additional useful features on these chimeric reporters. Diversity also can be created by a variety of other methods, 
including, for example, error prone PCR, oligonucleotide-directed mutagenesis with various degrees of bias towards 
wild-type codons (Cormack eta/., Gene 173:33-38, 1996, which is incorporated herein by reference), DNA shuffling 
(Crameri etal., Nat BiotechnoL, 14:315-319, 1996; Minshull and Stemmer, Curr. Opin. Chew. Biol, 3:284-290, 
1999, each of which is incorporated herein by reference), incorporation of random spacers, or "incremental truncation 
for the creation of hybrid enzymes" (Ostermsier et aL, Nat BiotechnoL, 17:1205-1209, 1999, which is incorporated 
herein by reference). 

(0165] High throughput screening can utilize a method such as fluorescence activated cell sorting 
(FACS), which can allow the identification of mutants or clones with the greatest brightness, highest expression levels 
under particular growth conditions, or unusual emissions spectra. FACS can be particularly useful for screening FRET 
constructs, in which the donor and acceptor are part of the same chimera, so that ratios of the two emissions reflect 
only FRET rather than variable expression levels. Chimeras having the most desirable characteristics then can be 
examined further to identify those that respond before and after application of a stimulus such as induction of kinase 
activity in a correlated way. 

[0 1 66] Although the chimeric phosphoryfatable indicators can be screened in bacteria, yeast, or 
mammalian cells, the latter are preferred because, ultimately, the indicators are most useful for studies of mammalian 
cells. Standard cultured lines such as HeLa, CHO, HEK-293, 3T3, or Jurkat are the cells of choice for the initial 
screening studies, although other cells, including cardiomyocytes, B cells, and the like can be used for confirmatory 
studies. For studies in mammalian cell lines, it is preferred that any individual cell pick up only a single chimeric 
sequence because the presence of fluorescent but nonresponsive proteins would fatally dilute and obscure the 
response of the good construct. Transfection methods such as electroporation or lipofection allow many different 
DNA molecules to enter each permeabilized cell and, therefore, are suitable only for screens in which nonfunctional 
sequences do not interfere. In comparison, a method such as retroviral infection can allow for insertion of a single 
chimera construct into the viral package. By working with a multiplicity of infection much less than 1, nearly all the 
transfectants will contain only a single introduced sequence. Cells that do not get infected will be nonfluorescent and 
noninterfering. 

[01 67] FACS can be used to discard nonfluorescent cells, and to deposit single fluorescent individuals 
into separate wells of microtiter plates. The cells then can be expanded so as to obtain a sufficient number of cells in 
each well, probably with ongoing antibiotic selection. The clones then can be read using a microplate fluorometer, 
before and after appropriately stimulating the cells, for example by pharmacologically activating an endogenous or 
coexpressed kinase or phosphatase. Alternatively, the cells can be lysed and the fluorescence read before and after 
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adding a separately expressed and purified enzyme. DNA recovered from the wells showing the best response then 
can be sequenced and re-expressed. 

[0168] A microscopy-based method also can be used. For example, a relatively large numbers of cells 
can be plated into each well of several microtiter plates and, after the cells have attached and physiologically 
equilibrated to the culture, a reagent can be added to stimulate the cells, which are then be re-imaged and compared 
to the prestimulus views. Appropriate software can highlight those cells having the greatest response to the stimulus, 
and a DNA sample can be obtained from the cells using a micropipet. The advantages of this strategy are that that it 
avoids the low survival-rate and long duration associated with growing clones from single isolated cells, and that 
much more diverse libraries can be examined, with hundreds to thousands of cells per well. 

[01 69] At least two versions of planar microfluidic cell sorting have been proposed, one using cells 
pumped single file by electroosmotically driven flow through a T junction molded into silicone elastomer (Fu et a/., Nat 
BiotechnoL, 17:1 1 09-1 111, 1999, which is incorporated herein by reference), and the other using arrays of cells 
individually releasable by microbubble formation. These methods work with nonadherent cells and, in principle, can be 
used to recover live cells, not just DNA samples. 

[0170] Although the invention has been described with reference to the above examples, it will be 
understood that modifications and variations are encompassed within the spirit and scope of the invention. 
Accordingly, the invention is limited only by the following claims. 
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What is claimed is: 

1 . A chimeric phosphorylation indicator, comprising, in operative linkage, a donor molecule, a 
phosphorylatable domain, a phosphoaminoacid binding domain, and an acceptor molecule, wherein the 
phosphoaminoacid binding domain specifically binds to a phosphoaminoacid when present in the phosphorylatable 
domain, wherein the donor molecule and the acceptor molecule exhibit a detectable resonance energy transfer when 
the donor is excited, and wherein the phosphorylatable domain and phosphoaminoacid binding domain do not 
substantially emit light to excite the acceptor. 

2. The chimeric phosphorylation indicator of claim 1, wherein at least one of the donor molecule or 
the acceptor molecule is a fluorescent protein. 

3. The chimeric phosphorylation indicator of claim 2, wherein the fluorescent protein is a green 
fluorescent protein (GFP), a red fluorescent protein (RFP), or a fluorescent protein related to a GFP or an RFP. 

4. The chimeric phosphorylation indicator of claim 3, wherein the red fluorescent protein is a 
Discosoma RFP or a fluorescent protein related to a Discosoma RFP. 

5. The chimeric phosphorylation indicator of claim 4, wherein the Discosoma RFP is DsRed or a 
mutant thereof. 

6. The chimeric phosphorylation indicator of claim 3, wherein the fluorescent protein is a GFP 
selected from an Aequorea GFP, a Renilla GFP, a Phialidium GFP, or a fluorescent protein related to an Aequorea GFP, 
a Renilla GFP, and a Phialidium GFP. 

7. The chimeric phosphorylation indicator of claim 6, wherein the fluorescent protein related to the 
Aequorea GFP is a cyan fluorescent protein (CFP), or a yellow fluorescent protein (YFP), or a spectral variant of the 
CFP or YFP. 

8. The chimeric phosphorylation indicator of claim 1, wherein the donor molecule is a fluorescent 
protein, and the detectable resonance energy transfer is fluorescent resonance energy transfer. 

9. The chimeric phosphorylation indicator of claim 1 , wherein at least one of the donor molecule or 
the acceptor molecule is a luminescent molecule. 

1 0. The chimeric phosphorylation indicator of claim 9, wherein the luminescent molecule further 
comprises a cell compartmentalization domain. 

11. The chimeric phosphorylation indicator of claim 1 0, wherein the cell compartmentalization domain 
is a membrane translocating domain. 

1 2. The chimeric phosphorylation indicator of claim 1 , wherein the donor molecule is a luminescent 
molecule, and the detectable resonance energy transfer is luminescent resonance energy transfer. 

1 3. The chimeric phosphorylation indicator of claim 1 2, wherein the acceptor molecule is a fluorescent 

protein. 
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14. The chimeric phosphorylation indicator of any of claims 1-13, wherein the phosphorylatable 
domain comprises a serine/threonine kinase phosphorylatable domain or a tyrosine kinase phosphorylatable domain. 

1 5. The chimeric phosphorylation indicator of claim 1 , wherein at least one amino acid of the 
phosphorylatable domain is phosphorylated. 

1 6. The chimeric phosphorylation indicator of claim 1 5, wherein the amino acid is serine, threonine, 
tyrosine, or a combination thereof. 

17. A chimeric phosphorylation indicator, comprising a phosphorylatable polypeptide and a fluorescent 

protein. 

1 8. The chimeric phosphorylation indicator of claim 1 7, wherein the phosphorylatable polypeptide 
comprises an l\l-terminal portion and a C-terminal portion, and wherein the fluorescent protein is operatively inserted 
between the N-terminal portion and C-terminal portion of the phosphorylatable polypeptide. 

1 9. The chimeric phosphorylation indicator of claim 1 8, wherein the fluorescent protein is a circularly 
permuted fluorescent protein. 

20. The chimeric phosphorylation indicator of claim 1 7, further comprising a phosphoaminoacid binding 
domain operatively linked to the phosphorylatable polypeptide, wherein the fluorescent protein comprises an N- 
terminal portion and a C-terminal portion, and wherein the phosphorylatable polypeptide and operatively linked 
phosphoaminoacid binding domain is operatively inserted between the N-terminal portion and C-terminal portion of the 
fluorescent protein. 

21 . The chimeric phosphorylation indicator of any of claims 17-20, wherein at least one amino acid of 
the phosphorylatable polypeptide is phosphorylated. 

22. A kit, comprising at least one of a chimeric phosphorylation indicator of claim 1, a chimeric 
phosphorylation indicator of claim 17, a polynucleotide encoding a chimeric phosphorylation indicator of claim 1, or a 
polynucleotide endoding a chimeric phosphorylation indicator of claim 17. 

23. A polynucleotide encoding the chimeric phosphorylation indicator of claim 1 or claim 1 7. 

24. A vector comprising the polynucleotide of claim 23. 

25. The vector of claim 24, which is an expression vector. 

26. A host cell containing the polynucleotide of claim 23. 

27. A method for detecting a kinase or phosphatase in a sample, the method comprising: 
contacting the sample with a chimeric phosphorylatable indicator of claim 1, 
exciting the donor molecule; and 

determining a fluorescence or luminescence property in the sample, 
wherein the presence of a kinase or phosphatase in the sample results in a change in the degree of fluorescence 
resonance energy transfer (FRET) or luminescence resonance energy transfer (LRET), thereby detecting the kinase or 
phosphatase in the sample. 
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28. The method of claim 27, wherein the change in the degree of FRET or LRET is an increased or 
decreased amount of FRET or LRET. 

29. The method of claim 27, wherein a change in the degree of FRET or LRET is indicative of a kinase 
or phosphatase in the sample. 

30. The method of claim 27, wherein the phosphorylatable domain is phosphorylated prior to 
contacting the sample with a chimeric phosphorylatable indicator. 

31 . The method of any of claims 27-30, wherein the sample is a biological sample. 

32. The method of claim 27, wherein the chimeric phosphorylatable indicator further comprises a 
targeting sequence wherein the targeting sequence targets the cytosol, endoplasmic reticulum, mitochondrial matrix, 
chloroplast lumen, medial trans-Golgi cisternae, a lumen of a lysosome, or a lumen of an endosome. 

33. The method of claim 27, wherein the chimeric phosphorylatable indicator further comprises a cell 
compartmentalization domain that is a membrane translocating domain. 

34. A method for detecting a kinase or phosphatase in a sample, the method comprising: 
contacting the sample with a chimeric phosphorylatable indicator of claim 17,, 
determining a fluorescence property in the sample, 

wherein the presence of kinase or phosphatase activity in the sample results in a change in the fluorescence property 
as compared to the fluorescent property in the absence of a kinase or phosphatase activity, thereby detecting the 
kinase or phosphatase in the sample. 

35. The method of claim 34, wherein the absence of kinase or phosphatase activity in the sample is 
due to the presence of a kinase inhibitor or phosphatase inhibitor. 

36. A method for detecting a kinase inhibitor or phosphatase inhibitor, the method comprising: 
determining a first fluorescence property of a chimeric phosphorylatable indicator of claim 1 in the 
presence of a kinase or a phosphatase, 

contacting the chimeric phosphorylatable indicator with a composition suspected of being a kinase 
inhibitor or a phosphatase inhibitor, 

determining a second fluorescence property of a chimeric phosphorylatable indicator in the 
presence of the composition, 

wherein a difference in the first fluorescence property and second fluorescence property identifies the composition as 
a kinase inhibitor or phosphatase inhibitor. 

37. The method of claim 36, which is adapted to high throughput analysis. 
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SEQUENCE LISTING 

<110> REGENTS OF THE UNIVERSITY OF CALIFORNIA 
TSIEN, Roger 
TING, Alice 
ZHANG, Jin 

<120> EMISSION RATIOMETRIC INDICATORS OF PHOSPHORYLATION 
<130> UC089 . 1CPC1QPC 
<160> 42 

<170> Patentln version 3.0 

<210> 1 

<211> 716 

<212> DNA 

<213> Aequorea victoria 
<220> 

<221> CDS 

<222> (1) . . (714) 

<400> 1 

atg agt aaa gga gaa gaa ctt ttc act gga gtt gtc cca att ctt gtt 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
1 5 10 15 

gaa tta gat ggt gat gtt aat ggg cac aaa ttt tct gtc agt gga gag 96 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

ggt gaa ggt gat gca aca tac gga aaa ctt acc ctt aaa ttt att tgc 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

act act gga aaa eta cct gtt cca tgg cca aca ctt gtc act act ttc 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

tct tat ggt gtt caa tgc ttt tea aga tac cca gat cat atg aaa cag 240 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

cat gac ttt ttc aag agt gec atg ccc gaa ggt tat gta cag gaa aga 2 88 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

act ata ttt ttc aaa gat gac ggg aac tac aag aca cgt get gaa gtc 336 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 
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gat ttt aaa gaa gat gga aac att ctt gga cac aaa ttg gaa tac aac 432 
Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat gga 480 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ate aaa gtt aac ttc aaa att aga cac aac att gaa gat gga age gtt 528 
He Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser Val 
165 170 175 

caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc cct 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt teg 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt gta 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

aca get get ggg att aca cat ggc atg gat gaa eta tac aaa ta 716 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 2 
<211> 238 
<212> PRT 

<213> Aequorea victoria 
<400> 2 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 



Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 



Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 



Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 



His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
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85 90 95 



Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 " 105 110 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 



Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 



He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 



Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leii Glu Phe Val 
210 215 220 



Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 

235 



225 


230 


<210> 


3 


<211> 


720 


<212> 


DNA 


<213> 


Aequorea victoria 


<220> 




<221> 


CDS 


<222> 


(1) . . (720) 


<400> 


3 



atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 
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gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lye Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag taa 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 4 

<211> 239 

<212> PRT 

<213> Aequorea victoria 
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<400> 4 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 
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Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 



225 



230 



235 



<210> 5 

<211> 720 

<212> DNA 

<213> Aequorea victoria 
<220> 

<221> CDS 

<222> (1)..(720) 

<400> 5 

at 9 9 fc 9 a 9 c aa 9 99 c 9 a 9 9 a 9 ct 9 ttc acc 999 9 fc 9 9^9 ccc atc ct 9 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac agg ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly 
20 25 30 

9 a 9 99c gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc atc 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tgg ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgt acc atc ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc atc gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

atc gac ttc aag gag gac ggc aac atc ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac atc age cac aac gtc tat atc acc gee gac aag cag aag aac 480 
Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc atc aag gec cac ttc aag atc cgc cac aac atc gag gac ggc age 528 
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Gly lie Lys Ala His Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtg cag etc gec gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag taa 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 6 

<211> 239 

<212> PRT 

<213> Aequorea victoria 

<400> 6 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 
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He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 



Gly He Lys Ala His Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro. Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 7 

<211> 720 

<212> DNA 

<213> Aequorea victoria 
<220> 

<221> CDS 

<222> (1) . . (720) 

<400> 7 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc gtg cag tgc ttc gee cgc tac ccc gac cac atg aag 240 
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Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc ace ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gee ctg 624 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gee gec ggg ate act etc ggc atg gac gag ctg tac aag taa 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 8 
<211> 239 
<212> PRT 

<213> Aequorea victoria 
<400> 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 ' 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 " 155 160 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 9 
<211> 720 
<212> DNA 
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<213> Aequorea victoria 
<220> 

<221> CDS 

<222> (1) . . {720) 

<400> 9 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

99 c £f a 9 99 c 9 at gee acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg aag tgc ttc gec cgc tac ccc gac cac atg aag 240 

Phe Gly Tyr Gly Leu Lys Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gec gac aag cag aag aac 480 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gec gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gee ctg 624 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
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195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag taa 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 10 

<211> 239 

<212> PRT 

<213> Aequorea victoria 

<400> 10 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1-5 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
. 50 55 60 



Phe Gly Tyr Gly Leu Lys Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 11 

<211> 859 

<212> DNA 

<213> Discosoma sp. 

<220> 

<221> CDS 

<222> (54) . . (731) 

<400> 11 

gtttcagcca gtgacggtca gtgacagggt gagccacttg gtataccaac aaa atg 56 

Met 
1 

agg tct tec aag aat gtt ate aag gag ttc atg agg ttt aag gtt cgc 104 
Arg Ser Ser Lys Asn Val He Lys Glu Phe Met Arg Phe Lys Val Arg 
5 10 15 

atg gaa gga acg gtc aat ggg cac gag ttt gaa ata gaa ggc gaa gga 152 
Met Glu Gly Thr Val Asn Gly His Glu Phe Glu He Glu Gly Glu Gly 
20 25 30 

gag ggg agg cca tac gaa ggc cac aat acc gta aag ctt aag gta acc 2 00 

Glu Gly Arg Pro Tyr Glu Gly His Asn Thr Val Lys Leu Lys Val Thr 
35 40 45 

aag ggg gga cct ttg cca ttt get tgg gat att ttg tea cca caa ttt 24 8 

Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp He Leu Ser Pro Gin Phe 
50 55 60 65 

cag tat gga age aag gta tat gtc aag cac cct gee gac ata cca gac 296 
Gin Tyr Gly Ser Lys Val Tyr Val Lys His Pro Ala Asp He Pro Asp 
70 75 80 

tat aaa aag ctg tea ttt cct gaa gga ttt aaa tgg gaa agg gtc atg 344 
Tyr Lys Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met 
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85 90 95 

aac ttt gaa gac ggt ggc gtc gtt act gta acc cag gat tec agt ttg 392 

Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gin Asp Ser Ser Leu 

100 105 110 



cag gat ggc tgt ttc ate tac aag gtc aag ttc att ggc gtg aac ttt 
Gin Asp Gly Cys Phe lie Tyr Lys Val Lys Phe He Gly Val Asn Phe 
115 120 125 



440 



cct tec gat gga cct gtt atg caa aag aag aca atg ggc tgg gaa gee 488 
Pro Ser Asp Gly Pro Val Met Gin Lys Lys Thr Met Gly Trp Glu Ala 
13 0 135 140 145 

age act gag cgt ttg tat cct cgt gat ggc gtg ttg aaa gga gag att 536 
Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu Lys Gly Glu He 
150 155 160 

cat aag get ctg aag ctg aaa gac ggt ggt cat tac eta gtt gaa ttc 584 
His Lys Ala Leu Lys Leu Lys Asp Gly Gly His Tyr Leu Val Glu Phe 
165 170 175 

aaa agt att tac atg gca aag aag cct gtg cag eta cca ggg tac tac 632 
Lys Ser He Tyr Met Ala Lys Lys Pro Val Gin Leu Pro Gly Tyr Tyr 
180 185 190 

tat gtt gac tec aaa ctg gat ata aca age cac aac gaa gac tat aca 680 
Tyr Val Asp Ser Lys Leu Asp He Thr Ser His Asn Glu Asp Tyr Thr 
195 200 205 

ate gtt gag cag tat gaa aga acc gag gga cgc cac cat ctg ttc ctt 728 
He Val Glu Gin Tyr Glu Arg Thr Glu Gly Arg His His Leu Phe Leu 
210 215 220 225 

taa ggctgaactt ggctcagacg tgggtgagcg gtaatgacca caaaaggcag 781 

cgaagaaaaa ecatgategt tttttttagg ttggcagcct gaaategtag gaaatacatc 841 

agaaatgtta caaacagg 859 



<210> 12 

<211> 225 

<212> PRT 

<213> Discosoma sp. 

<400> 12 

Met Arg Ser Ser Lys Asn Val He Lys Glu Phe Met Arg Phe Lys Val 
15 10 15 



Arg Met Glu Gly Thr Val Asn Gly His Glu Phe Glu He Glu Gly Glu 
20 25 30 



Gly Glu Gly Arg Pro Tyr Glu Gly His Asn Thr Val Lys Leu Lys Val 
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35 40 45 



Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp lie Leu Ser Pro Gin 
50 55 60 



Phe Gin Tyr Gly Ser Lys Val Tyr Val Lys His Pro Ala Asp lie Pro 
65 70 75 80 



Asp Tyr Lys Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val 
85 90 95 



Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gin Asp Ser Ser 
100 105 110 



Leu Gin Asp Gly Cys Phe lie Tyr Lys Val Lys Phe lie Gly Val Asn 
115 120 125 



Phe Pro Ser Asp Gly Pro Val Met Gin Lys Lys Thr Met Gly Trp Glu 
130 " 135 140 



Ala Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu Lys Gly Glu 
145 150 155 160 



lie His Lys Ala Leu Lys Leu Lys Asp Gly Gly His Tyr Leu Val Glu 
165 170 175 



Phe Lys Ser lie Tyr Met Ala Lys Lys Pro Val Gin Leu Pro Gly Tyr 
180 "* 185 190 



Tyr Tyr Val Asp Ser Lys Leu Asp lie Thr Ser His Asn Glu Asp Tyr 
195 200 205 



Thr He Val Glu Gin Tyr Glu Arg Thr Glu Gly Arg His His Leu Phe 
210 215 220 



Leu 
225 



<210> 13 

<211> 18 

<212> PRT 

<213> Artificial sequence 
<220> 
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<223> Synthetic peptide 
<400> 13 

Gly Ser Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser Thr 



1 


5 


Lys Gly 


<210> 


14 


<211> 


8 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 




<221> 


VARIANT 


<222> 


(2) (2) 


<223> 


Phosphorylated peptide 



<400> 14 

Asp' Tyr lie He Pro Leu Pro Asp 
1 5 



<210> 


15 


<211> 


9 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 




<221> 


VARIANT 


<222> 


(8) . . (8) 


<223> 


Phosphorylated peptide 



<400> 15 

His He He Glu Asn Pro Gin Tyr Phe 
1 5 



<210> 


16 


<211> 


8 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 
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<221> VARIANT 
<222> (5).. (5) 
<223> Phosphorylated peptide 



<400> 16 

Ala Arg Ser His Ser Tyr Pro Ala 
1 5 



<210> 


17 


<211> 


6 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 




<221> 


VARIANT 


<222> 


(3) ..(4) 


<223> 


Xaa is any Amino Acid 



<400> 17 

Cys Cys Xaa Xaa Cys Cys 
1 5 



<210> 


IB 


<211> 


15 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<400> 


18 



Cys Arg Gin lie Lys Trp Phe Asn Arg Arg Met Lys Trp Lys Lys 
15 10 J 15 



<210> 


19 


<211> 


7 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<400> 


19 



Ala Gly Gly Thr Gly Gly Ser 
1 5 

<210> 20 
<211> 7 
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<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic peptide 

<400> 20 

Leu Arg Arg Ala Ser Leu Pro 
1 5 

<210> 21 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<22 3> Synthetic peptide 

<400> 21 

Gly Gly Thr Gly Gly Ser Glu Leu 
1 5 

<210> 22 

<211> 9 

<212> PRT 

<213> Artificial sequence 
<220> 

<22 3> Synthetic peptide 

<400> 22 

Gly Ser His Ser Gly Ser Gly Lys Pro 
1 5 



<210> 


23 


<211> 


13 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<400> 


23 



Glu Glu Glu Ala Glu Tyr Met Asn Met Ala Pro Gin Ser 
15 10 

<210> 24 

<211> 15 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic peptide 
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<400> 24 

Gly Ser Thr Ser Gly Ser Gly Lys Pro Gly Ser Gly Glu Gly Ser 
15 10 15 

<210> 25 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic peptide 

<400> 25 

Glu lie Tyr Gly Glu Phe 
1 5 

<210> 26 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR 

<400> 26 

gggcatgcat atggagaaga ctgagctgat ccag 34 



<210> 27 
<211> 92 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR 
<400> 27 

cgcggagctc gctgccgccg gtgccgccca ggctggcgcg acggaggctg ccgccggtgc 60 
cgcctgcaga gtctgatgtc caaagtgtta gg 92 



<210> 28 

<211> 33 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for PCR 

<400> 28 

cgtcgcgcca gcctgccagg caccggcggc age 33 
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<210> 29 

<211> 33 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for PCR 

<400> 29 

gctgccgccg gtggctggca ggctggcgcg acg 



<210> 30 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for PCR 

<400> 30 

gcctccgtcg cgccgcactg ccaggcaccg gc 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer for PCR 

<400> 31 

gccggtgcct ggcagtgcgg cgcgacggag gc 



<210> 32 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic peptide 

<400> 32 

Leu Arg Arg Ala Ser Leu Gly 
1 5 

<210> 33 

<211> 9 

<212> PRT 

<213> Artificial sequence 
<220> 

<22 3> Synthetic peptide 
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<400> 33 

Ala Gin Arg Ser Thr Ser Thr Pro Asn 



1 


5 


<210> 


34 


<211> 


6 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 




<221> 


VARIANT 


<222> 


(1)..(6) 


<223> 


Xaa is any Animo Acid 


<400> 


34 


Arg Ser Xaa Ser Xaa Pro 


1 


5 


<210> 


35 


<211> 


4 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 




<221> 


VARIANT 


<222> 


(3) . . (3) 


<223> 


Xaa is any Amino Acid 



<400> 


35 


Arg Arg Xaa Ser 


1 




<210> 


36 


<211> 


4 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic peptide 


<220> 




<221> 


VARIANT 


<222> 


(2) .. (3) 


<223> 


Xaa is any Amino Acid 
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<220> 

<221> VARIANT 

<222> (4) . . (4) 

<223> Xaa is Serine or Threonine 



<400> 36 

Arg Xaa Xaa Xaa 
1 

<210> 37 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic peptide 
<220> 

<221> VARIANT 

<222> (1) . . (1) 

<223> Xaa is Lysine or Arginine 



<220> 

<221> VARIANT 

<222> (2) . . (3) 

<22 3> Xaa is any Amino Acid 



<220> 

<221> VARIANT 

<222> (4) . . (4) 

<223> Xaa is Serine or Threonine 



<400> 37 

Xaa. Xaa Xaa Xaa 
1 

<210> 38 
<211> 39 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR 
<400> 38 

gccgcccgca tgcattggtt ccacgggaag ctgagccgg 



<210> 39 
<211> 99 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR 
<400> 39 

taccatgagc tctgattgcg gagccatgtt catgtactca gcttcctctt caggcttccc 60 
agatccagag tgagacccca cgggttgctc taggcacag 99 



<210> 40 

<211> 35 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR 

<400> 40 

gccgctcgca tgcattggta ttttgggaag atcac 35 



<210> 41 
<211> 93 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR 
<400> 41 

caccatgagc tcaaattcac cgtagatctc agaaccctca ccagaacccg gcttcccaga 60 
tccagatgta gacccacaga cgttagtcag gcg 93 



<210> 42 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic peptide 
<220> 

<221> VARIANT 

<222> (2) . . (2) 

<223> Xaa is Arginine or Lysine 



<220> 

<221> VARIANT 

<222> (3) . . (3) 

<22 3> Xaa is Phenylalanine, Arginine, Serine or Asparagine 
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<220> 

<221> VARIANT 

<222> (4) . . (4) 

<223> Xaa is Arginine, Histidine, Lysine 



<220> 

<221> VARIANT 

<222> (5).. (5) 

<223> Phosphorylated peptide 



<220> 

<221> VARIANT 

<222> (6) . . (6) 

<223> Xaa is Tryptophan, Tyrosine, Phenylalanine, Leucine 



<400> 42 

Arg Xaa Xaa Xaa Ser Xaa Pro 
1 5 
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